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Description 

Technical Field 

5 [0001] The present invention relates to a method and apparatus for interleaving a bitstream for use in an authoring 
system for variously processing a data bitstream comprising the video data, audio data, and sub-picture data consti- 
tuting each of plural program titles containing related video data, audio data, and sub-picture data content to generate 
a bitstream from which a new title containing the content desired by the user can be reproduced, and efficiently recording 
and reproducing said generated bitstream using a particular recording medium. 

10 

Background Art 

[0002] Authoring systems used to produce program titles comprising related video data, audio data, and sub-picture 
data by digitally processing, for example, multimedia data comprising video, audio, and sub-picture data recorded to 

'5 laser disk or video CD formats are currently available. 

[0003] Systems using Video-CDs in particular are able to record video data to a CD format disk, which was originally 
designed with an approximately 600 MB recording capacity for storing digital audio data only, by using such high 
efficiency video compression techniques as MPEG. As a result of the increased effective recording capacity achieved 
using data compression techniques, karaoke titles and other conventional laser disk applications are gradually being 

20 transferred to the video CD format. 

[0004] Users today expect both sophisticated title content and high reproduction quality. To meet these expectations, 
each title must be composed from bitstreams with an increasingly deep hierarchical structure. The data size of multi- 
media titles written with bitstreams having such deep hierarchical structures, however, is ten or more times greater 
than the data size of less complex titles. The need to edit small image (title) details also makes it necessary to process 

25 and control the bitstream using low order hierarchical data units. 

[0005] It is therefore necessary to develop and prove a bitstream structure and an advanced digital processing meth- 
od including both recording and reproduction capabilities whereby a large volume, multiple level hierarchical digital 
bitstream can be efficiently controlled at each level of the hierarchy. Also needed are an apparatus for executing this 
digital processing method, and a recording media to which the bitstream digitally processed by said apparatus can be 

30 efficiently recorded for storage and from which said recorded information can be quickly reproduced. 

[0006] Means of increasing the storage capacity of conventional optical disks have been widely researched to ad- 
dress the recording medium aspect of this problem. One way to increase the storage capacity of the optical disk is to 
reduce the spot diameter D of the optical (laser) beam. If the wavelength of the laser beam is I and the aperture of the 
objective lens is NA, then the spot diameter D is proportional to l/NA, and the storage capacity can be efficiently im- 

35 proved by decreasing I and increasing NA. 

[0007] As described, for example, in United States Patent 5,235,581 , however, coma caused by a relative tilt between 
the disk surface and the optical axis of the laser beam (hereafter "tilt") increases when a large aperture (high NA) lens 
is used. To prevent tilt-induced coma, the transparent substrate must be made very thin. The problem is that the me- 
chanical strength of the disk is low when the transparent substrate is very thin. 

40 [0008] MPEG1 , the conventional method of recording and reproducing video, audio, and graphic signal data, has 
also been replaced by the more robust MPEG2 method, which can transfer large data volumes at a higher rate. It 
should be noted that the compression method and data format of the MPEG2 standard differ somewhat from those of 
MPEG1. The specific content of and differences between MPEG1 and MPEG2 are described in detail in the ISO-1 1 1 72 
and ISO-1 381 8 MPEG standards, and further description thereof is omitted below. 

45 [0009] Note, however, that while the structure of the encoded video stream is defined in the MPEG2 specification, 
the hierarchical structure of the system stream and the method of processing lower hierarchical levels are not defined. 
[0010] As described above, it is therefore not possible in a conventional authoring system to process a large data 
stream containing sufficient information to satisfy many different user requirements. Moreover, even if such a processing 
method were available, the processed data recorded thereto cannot be repeatedly used to reduce data redundancy 

50 because there is no large capacity recording medium currently available that can efficiently record and reproduce high 
volume bitstreams such as described above. 

[001 1 ] More specifically, particular significant hardware and software requirements must be satisfied in order to proc- 
ess a bitstream using a data unit smaller than the title. These specific hardware requirements include significantly 
increasing the storage capacity of the recording medium and increasing the speed of digital processing; software 
55 requirements include inventing an advanced digital processing method including a sophisticated data structure. 

[0012] Therefore, the object of the present invention is to provide an effective authoring system for controlling a 
multimedia data bitstream with advanced hardware and software requirements using a data unit smaller than the title 
to better address advanced user requirements. 
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[0013] To share data between plural titles and thereby efficiently utilize optical disk capacity, multi-scene control 
whereby scene data common to plural titles and the desired scenes on the same time-base from within multi-scene 
periods containing plural scenes unique to particular reproduction paths can be freely selected and reproduced is 
desirable. 

5 [0014] However, when plural scenes unique to a reproduction path within the multi-scene period are arranged on 
the same time-base, the scene data must be contiguous. Unselected multi-scene data is therefore unavoidably inserted 
between the selected common scene data and the selected multi-scene data. The problem this creates when repro- 
ducing multi-scene data is that reproduction is interrupted by this unselected scene data, 

[0015] The object of the present invention is therefore to provide a data structure enabling discrete scene data to be 
10 seamlessly reproduced, i.e., without intermittence in the data presentation, even when reproducing such multi-scene 
data; a method for generating a system stream having said data structure; a recording apparatus and a reproduction 
apparatus for recording and reproducing said system stream; and a medium to which said system stream can be 
reproduced and from which said system stream can be reproduced by said recording apparatus and reproduction 
apparatus. 

15 [0016] A further problem can be expected when the multi-scene data is multi-angle scene data, i.e., scene data 
showing substantially the same subject from different angles. In the case of a live sports broadcast, this multi-angle 
scene data may be obtained by recording a baseball batter, for example, with cameras in different locations. The 
problem is thus that when the user selects one of these plural angle scenes during data presentation, data from different 
angles is connected where the angle changes, and a natural, seamless presentation cannot be expected. 

20 [001 7] Therefore, the object of the present invention is to provide a data structure whereby natural, seamless repro- 
duction without scene data intermitting can be achieved even in such multi-angle scene periods; a method for gener- 
ating a system stream having said data structure; a recording apparatus and a reproduction apparatus for recording 
and reproducing said system stream; and a medium to which said system stream can be recorded and from which 
said system stream can be reproduced by said recording apparatus and reproduction apparatus. 

25 [0018] The present application is based upon Japanese Patent Application No. 7-252734, which was filed on Sep- 
tember 29, 1995. 

[0019] WO 95/121 79 discloses a data block format for software carrier and a player therefor. WO 92/05554 discloses 
a picture retrieval system, a record carrier and a device for use in such a system. Both documents describe a disc 
reproduction apparatus, comprising a means for reading a bit stream as well as a reproduction method for a disc. 

30 

DISCLOSURE OF THE INVENTION 

[0020] In order to achieve the aforementioned objects, a bit stream generating method according to claim 1, an 
optical disc according to claim 4, a disc reproduction apparatus according to claim 8 and a reproduction method for a 
35 disc according to claim 11 are proposed according to the invention. 

Brief Description of Drawings 

[0021] 

40 

Fig. 1 is a graph schematically showing a structure of multi media bit stream according to the present invention, 
Fig. 2 is a block diagram showing an authoring encoder according to the present invention, 
Fig. 3 is a block diagram showing an authoring decoder according to the present invention, 
Fig. 4 is a side view of an optical disk storing the multi media bit stream of Fig. 1 , 
45 Fig. 5 is an enlarged view showing a portion confined by a circle of Fig. 4, 

Fig. 6 is an enlarged view showing a portion confined by a circle of Fig. 5, 
Fig. 7 is a side view showing a variation of the optical disk of Fig. 4, 
Fig. 8 is a side view showing another variation of the optical disk of Fig. 4, 

Fig. 9 is a plan view showing one example of track path formed on the recording surface of the optical disk of Fig. 4, 
50 Fig. 10 is a plan view showing another example of track path formed on the recording surface of the optical disk 

of Fig. 4, 

Fig. 11 is a diagonal view schematically showing one example of a track path pattern formed on the optical disk 
of Fig. 7, 

Fig. 12 is a plan view showing another example of track path formed on the recording surface of the optical disk 
55 of Fig. 7, 

Fig. 13 is a diagonal view schematically showing one example of a track path pattern formed on the optical disk 
of Fig. 8, 

Fig. 14 is a plan view showing another example of track path formed on the recording surface of the optical disk 
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of Fig. 8, 

Fig. 15 is a graph showing the audio waves resulting from the audio data recorded to the multi-angle data within 
multi-angle scene period, 

Fig. 1 6 is a graph schematically showing the structure of multimedia bit stream for use in Digital Video Disk system 

5 according to the present invention, 

Fig. 17 is a graph schematically showing the encoded video stream according to the present invention, 

Fig. 18 is a graph schematically showing an internal structure of a video zone of Fig. 16. 

Fig. 1 9 is a graph schematically showing the stream management information according to the present invention, 

Fig. 20 is a graph schematically showing the structure the navigation pack NV of Fig. 17, 

10 Fig. 21 is a graph is assistance of explaining a concept of parental lock playback control according to the present 

invention, 

Fig. 22 is a graph schematically showing the data structure used in a digital video disk system according to the 
present invention, 

Fig. 23 is a graph in assistance of explaining a concept of Multi-angle scene control according to the present 
15 invention, 

Fig. 24 is a graph in assistance of explaining a concept of multi scene data connection, 
Fig. 25 is a block diagram showing a DVD encoder according to the present invention, 
Fig. 26 is a block diagram showing a DVD decoder according to the present invention, 

Fig. 27 is a graph schematically showing an encoding information table generated by the encoding system controller 
20 of Fig. 25, 

Fig. 28 is a graph schematically showing an encoding information tables, 

Fig. 29 is a graph schematically showing an encoding parameters used by the video encoder of Fig. 25, 

Fig. 30 is a graph schematically showing an example of the contents of the program chain information according 

to the present invention, 

25 Fig. 31 is a graph schematically showing another example of the contents of the program chain information ac- 

cording to the present invention, 

Fig. 32 is a graph schematically showing a relationship between the accumulated data quantity in the stream buffer 
and the operating mode of the reading head of Fig. 26, 

Fig. 33 is a graph in assistance of explaining a concept of sharing data between plural titles, 
30 Fig. 34 is a graph in assistance of explaining a concept of contiguous reproduction of non-contiguously arranged 

data block, 

Fig. 35 is a graph in assistance of explaining a concept of interleaving process enabling seamless reproduction 
of scenes, 

Fig. 36 is a graph also in assistance of explaining a concept of interleaving process enabling seamless reproduction 
35 of scenes, 

Fig. 37 is a graph schematically showing an actual arrangement of data blocks recorded to a data recording track 
on a recording medium according to the present invention, 

Fig. 38 is a graph schematically showing contiguous block regions and interleaved block regions array, 
Fig. 39 is a graph schematically showing a content of a VTS title VOBS according to the present invention, 
40 Fig. 40 is a graph schematically showing an internal data structure of the interleaved block regions according to 

the present invention, 

Fig. 41 is a graph in assistance of explaining a concept of defining the minimum number of divisions in VOB for 
interleaving, 

Fig. 42 is a graph schematically showing the data structure of the interleaved unit, 
45 Fig. 43 is a graph in assistance of explaining a concept of parental control according to the present invention, 

Fig. 44 is a graph in assistance of explaining a concept of multi-angle scene control according to the present in 
invention, 

Fig. 45 is a graph in assistance of explaining the multi-angle scene control when two multi-angle data having 
different audio data, 

50 Fig. 46 is a graph schematically showing plural angle data written to multi-angle scene period according to the 

present invention, 

Fig. 47 is a graph showing the stream containing a multi-angle scene period and the basics of the on-disk layout 
according to the present invention, 

Fig. 48 is a graph showing an example in which the last pack address in the current A-ILVU and the address of 
55 the next A-ILVU are recorded for plural angles according to the present invention, 

Figs. 49 and 50 are graphs in assistance of explaining the method of managing addresses on the switching unit 

level between multiple angle data in multi-angle scene periods according to the present invention, 

Fig. 51 is a flow chart, formed by Figs. 51 A and 51 B, showing an operation of the DVD encoder of Fig. 25, 
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Fig. 52 is a flow chart showing detailed of the encode parameter production sub-routine of Fig. 51 , 
Fig. 53 is a flow chart showing the detailed of the VOB data setting routine of Fig. 52, 
Fig. 54 is a flow chart showing the encode parameters generating operation for a seamless switching, 
Fig. 55 is a flow chart showing the encode parameters generating operation for a system stream, 

5 Fig. 56 is a flow chart showing the operation of the DVD encoder of Fig. 26, 

Fig. 57 is a flow chart showing details of the multi-angle non-seamless switching control routine of Fig. 56, 
Fig. 58 is a flow chart showing details of the multi-angle seamless switching control routine of Fig. 56, 
Fig. 59 is a flow chart showing details of the parental lock sub-routine of Fig. 56, 
Fig. 60 is a flow chart showing details of the single scene subroutine of Fig. 56, 

10 Fig. 61 is a flow chart showing the encode parameters generating operation for a system stream containing a 

single scene, 

Figs. 62 and 63 are graphs showing decoding information table produced by the decoding system controller of 
Fig. 26, 

Fig. 64 is a flow chart showing details of reproduction extracted PGC routing of Fig. 69, 
is Fig. 65 is a flow chart showing details of non-seamless multi-angle decoding process of Fig. 70, 

Fig. 66 is a block diagram showing details of the stream buffer of Fig. 26, 

Fig. 67 is a flow chart showing details of decoding data process of Fig. 64, performed by the stream buffer, is shown, 
Fig. 68 is a flow chart showing details of the decoder synchronization process of Fig. 67, 
Fig. 69 is a flow chart showing the operation of the DVD decoder DCD of Fig. 26, 

20 Fig. 70 is a flow chat showing details of the stream buffer data transfer process according to the present invention, 

Fig. 71 is a flow chart showing details of the non multi-angle decoding process of Fig. 70, 
Fig. 72 is a flow chart showing details of the non-multi-angled interleave process of Fig. 71 , 
Fig. 73 is a flow chart showing details of the non-multi-angled contiguous block process, 
Fig. 74 is a flow chart showing details of the non-multi-angle decoding process of Fig. 70, 

25 Fig. 75 is a flow chart showing details of the seamless multi-angle decoding process of Fig. 70, 

Figs. 76 and 77 are graphs in assistance of explaining the switching between scene angles within a multi-angle 
scene period, 

Fig. 78 is a graph in assistance of explaining a method for achieving the data structure shown in Fig. 46, 
Fig. 79 is a graph in assistance of explaining a method for actually setting the audio data presentation times on 
30 the smallest angle switching unit level to the same value in different angles, and 

Fig. 80 is a graph in assistance of explaining the multi-angle scene period data structure in which common audio 
data is written to each different angle according to the present invention. 



35 



Best Mode for Carrying Out the Invention 

[0022] The prevent invention is detailedly described with reference to the accompanying drawings. 



Data structure of the authoring system 



40 [0023] The logic structure of the multimedia data bitstream processed using the recording apparatus, recording me- 
dium, reproduction apparatus, and authoring system according to the [present invention is described first below with 
reference to Fig. 1 . 

[0024] In this structure, one title refers to the combination of video and audio data expressing program content rec- 
ognized by a user for education, entertainment, or other purpose. Referenced to a motion picture (movie), one title 
^5 may correspond to the content of an entire movie, or to just one scene within said movie. 

[0025] A video title set (VTS) comprises the bitstream data containing the information for a specific number of titles. 
More specifically, each VTS comprises the video, audio, and other reproduction data representing the content of each 
title in the set, and control data for controlling the content data. 

[0026] The video zone VZ is the video data unit processed by the authoring system, and comprises a specific number 
so of video title sets. More specifically, each video zone is a linear sequence of K + 1 video title sets numbered VTS #0 
- VTS #K where K is an integer value of zero or greater. One video title set, preferably the first video title set VTS #0, 
is used as the video manager describing the content information of the titles contained in each video title set. 
[0027] The multimedia bitstream MBS is the largest control unit of the multimedia data bitstream handled by the 
authoring system of the present invention, and comprises plural video zones VZ. 

55 

Authoring encoder EC 



[0028] A preferred embodiment of the authoring encoder EC according to the present invention for generating a new 
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multimedia bitstream MBS by re-encoding the original multimedia bitstream MBS according to the scenario desired by 
the user is shown in Fig. 2. Note that the original multimedia bitstream MBS comprises a video stream St1 containing 
the video information, a sub-picture stream St3 containing caption text and other auxiliary video information, and the 
audio stream St5 containing the audio information. 
5 [0029] The video and audio streams are the bitstreams containing the video and audio information obtained from 
the source within a particular period of time. The sub-picture stream is a bitstream containing momentary video infor- 
mation relevant to a particular scene. The sub-picture data encoded to a single scene may be captured to video memory 
and displayed continuously from the video memory for plural scenes as may be necessary. 

[0030] When this multimedia source data St1 , St3, and St5 is obtained from a live broadcast, the video and audio 
10 signals are supplied in real-time from a video camera or other imaging source; when the multimedia.source data is 
reproduced from a video tape or other recording medium, the audio and video signals are not real-time signals. 
[0031] While the multimedia source stream is shown in Fig. 2 as comprising these three source signals, this is for 
convenience only, and it should be noted that the multimedia source stream may contain more than three types of 
source signals, and may contain source data for different titles. Multimedia source data with audio, video, and sub- 
's picture data for plural titles are referred to below as multi-title streams. 

[0032] As shown in Fig. 2, the authoring encoder EC comprises a scenario editor 100, encoding system controller 
200, video encoder 300, video stream buffer 400, sub-picture encoder 500, sub-picture stream buffer 600, audio en- 
coder 700, audio stream buffer 800, system encoder 900, video zone formatter 1300, recorder 1200, and recording 
medium M. 

20 [0033] The video zone formatter 1300 comprises video object (VOB) buffer 1000, formatter 1100, and volume and 
file structure formatter 1400. 

[0034] The bitstream encoded by the authoring encoder EC of the present embodiment is recorded by way of example 
only to an optical disk. 

[0035] The scenario editor 1 00 of the authoring encoder EC outputs the scenario data, i.e., the user-defined editing 
25 instructions. The scenario data controls editing the corresponding parts of the multimedia bitstream MBS according to 
the user's manipulation of the video, sub-picture, and audio components of the original multimedia title. This scenario 
editor 100 preferably comprises a display, speaker(s), keyboard, CPU, and source stream buffer. The scenario editor 
1 00 is connected to an external multimedia bitstream source from which the multimedia source data St1 , St3, and St5 
are supplied. 

30 [0036] The user is thus able to reproduce the video and audio components of the multimedia source data using the 
display and speaker to confirm the content of the generated title. The user is then able to edit the title content according 
to the desired scenario using the keyboard, mouse, and other command input devices while confirming the content of 
the title on the display and speakers. The result of this multimedia data manipulation is the scenario data St7. 
[0037] The scenario data St7 is basically a set of instructions describing what source data is selected from ail or a 

35 subset of the source data containing plural titles within a defined time period, and how the selected source data is 
reassembled to reproduce the scenario (sequence) intended by the user. Based on the instructions received through 
the keyboard or other control device, the CPU codes the position, length, and the relative time-based positions of the 
edited parts of the respective multimedia source data streams St1 , St3, and St5 to generate the scenario data St7. 
[0038] The source stream buffer has a specific capacity, and is used to delay the multimedia source data streams 

40 st1 , St3, and St5 a known time Td and then output streams St1 , St3, and St5. 

[0039] This delay is required for synchronization with the editor encoding process. More specifically, when data 
encoding and user generation of scenario data St7 are executed simultaneously, i.e., when encoding immediately 
follows editing, time Td is required to determine the content of the multimedia source data editing process based on 
the scenario data St7 as will be described further below. As a result, the multimedia source data must be delayed by 

45 time Td to synchronize the editing process during the actual encoding operation. Because this delay time Td is limited 
to the time required to synchronize the operation of the various system components in the case of sequential editing 
as described above, the source stream buffer is normally achieved by means of a high speed storage medium such 
as semiconductor memory. 

[0040] During batch editing in which all multimedia source data is encoded at once ("batch encoded") after scenario 
50 data St7 is generated for the complete title, delay time Td must be long enough to process the complete title or longer. 
In this case, the source stream buffer may be a low speed, high capacity storage medium such as video tape, magnetic 
disk, or optical disk. 

[0041] The structure (type) of media used for the source stream buffer may therefore be determined according to 
the delay time Td required and the allowable manufacturing cost. 
55 [0042] The encoding system controller 200 is connected to the scenario editor 1 00 and receives the scenario data 
St7 therefrom. Based on the time-base position and length information of the edit segment contained in the scenario 
data St7, the encoding system controller 200 generates the encoding parameter signals St9, St11, and St13 for en- 
coding the edit segment of the multimedia source data. The encoding signals St9, St1 1 , and St 1 3 supply the parameters 
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used for video, sub-picture, and audio encoding, including the encoding start and end timing. Note that multimedia 
source data St1 , St3, and St5 are output after delay time Td by the source stream buffer, and are therefore synchronized 
to encoding parameter signals St9, St11, and St13. 

[0043] More specifically, encoding parameter signal St9 is the video encoding signal specifying the encoding timing 
5 of video stream St1 to extract the encoding segment from the video stream St1 and generate the video encoding unit. 
Encoding parameter signal St11 is likewise the sub-picture stream encoding signal used to generate the sub-picture 
encoding unit by specifying the encoding timing for sub-picture stream St3. Encoding parameter signal St13 is the 
audio encoding signal used to generate the audio encoding unit by specifying the encoding timing for audio stream St5. 
[0044] Based on the time-base relationship between the encoding segments of streams St1, St3, and St5 in the 
10 multimedia source data contained in scenario data St7, the encoding system controller 200 generates the timing signals 
St21, St23, and St25 arranging the encoded multimedia-encoded stream in the specified time-base relationship. 
[0045] The encoding system controller 200 also generates the reproduction time information IT defining the repro- 
duction time of the title editing unit (video object, VOB), and the stream encoding data St33 defining the system encode 
parameters for multiplexing the encoded multimedia stream containing video, audio, and sub-picture data. Note that 
15 the reproduction time information IT and stream encoding data St33 are generated for the video object VOB of each 
title in one video zone VZ. 

[0046] The encoding system controller 200 also generates the title sequence control signal St39, which declares the 
formatting parameters for formatting the title editing units VOB of each of the streams in a particular time-base rela- 
tionship as a multimedia bitstream. More specifically, the title sequence control signal St39 is used to control the con- 

20 nections between the title editing units (VOB) of each title in the multimedia bitstream MBS, or to control the sequence 
of the interleaved title editing unit (VOBs) interleaving the title editing units VOB of plural reproduction paths. 
[0047] The video encoder 300 is connected to the source stream buffer of the scenario editor 1 00 and to the encoding 
system controller 200, and receives therefrom the video stream St1 and video encoding parameter signal St9, respec- 
tively. Encoding parameters supplied by the video encoding signal St9 include the encoding start and end timing, bit 

25 rate, the encoding conditions for the encoding start and end, and the material type. Possible material types include 
NTSC or PAL video signal, and telecine converted material. Based on the video encoding parameter signal St9, the 
video encoder 300 encodes a specific part of the video stream St1 to generate the encoded video stream St15. 
[0048] The sub-picture encoder 500 is similarly connected to the source stream buffer of the scenario editor 1 00 and 
to the encoding system controller 200, and receives therefrom the sub-picture stream St3 and sub-picture encoding 

30 parameter signal St1 1 , respectively. Based on the sub-picture encoding parameter signal St1 1 , the sub-picture encoder 
500 encodes a specific part of the sub-picture stream St3 to generate the encoded sub-picture stream St1 7. 
[0049] The audio encoder 700 is also connected to the source stream buffer of the scenario editor 100 and to the 
encoding system controller 200, and receives therefrom the audio stream St5 and audio encoding parameter signal 
St13, which supplies the encoding start and end timing. Based on the audio encoding parameter signal St13, the audio 

35 encoder 700 encodes a specific part of the audio stream St5 to generate the encoded audio stream St19. 

[0050] The video stream buffer 400 is connected to the video encoder 300 and to the encoding system controller 
200. The video stream buffer 400 stores the encoded video stream St1 5 input from the video encoder 300, and outputs 
the stored encoded video stream St15 as the time-delayed encoded video stream St27 based on the timing signal 
St21 supplied from the encoding system controller 200. 

40 [0051] The sub-picture stream buffer 600 is similarly connected to the sub-picture encoder 500 and to the encoding 
system controller 200. The sub-picture stream buffer 600 stores the encoded sub-picture stream St1 7 output from the 
sub-picture encoder 500, and then outputs the stored encoded sub-picture stream St1 7 as time-delayed encoded sub- 
picture stream St29 based on the timing signal St23 supplied from the encoding system controller 200. 
[0052] The audio stream buffer 800 is similarly connected to the audio encoder 700 and to the encoding system 

45 controller 200. The audio stream buffer 800 stores the encoded audio stream Stl9 input from the audio encoder 700, 
and then outputs the encoded audio stream St1 9 as the time-delayed encoded audio stream St31 based on the timing 
signal St25 supplied from the encoding system controller 200. 

[0053] The system encoder 900 is connected to the video stream buffer 400, sub-picture stream buffer 600, audio 
stream buffer 800, and the encoding system controller 200, and is respectively supplied thereby with the time-delayed 
so encoded video stream St27, time-delayed encoded sub-picture stream St29, time-delayed encoded audio stream St31 , 
and the stream encoding data St33. Note that the system encoder 900 is a multiplexer that multiplexes the time-delayed 
streams St27, St29, and St31 based on the stream encoding data St33 (timing signal) to generate title editing unit 
(VOB) St35. The stream encoding data St33 contains the system encoding parameters, including the encoding start 
and end timing. 

55 [0054] The video zone formatter 1300 is connected to the system encoder 900 and the encoding system controller 
200 from which the title editing unit (VOB) St35 and title sequence control signal St39 (timing signal) are respectively 
supplied. The title sequence control signal St39 contains the formatting start and end timing, and the formatting pa- 
rameters used to generate (format) a multimedia bitstream MBS. The video zone formatter 1300 rearranges the title 
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editing units (VOB) St35 in one video zone VZ in the scenario sequence defined by the user based on the title sequence 
control signal St39 to generate the edited multimedia stream data St43. 

[0055] The multimedia bitstream MBS St43 edited according to the user-defined scenario is then sent to the recorder 
1200. The recorder 1200 processes the edited multimedia stream data St43 to the data stream St45 format of the 

5 recording medium M, and thus records the formatted data stream St45 to the recording medium M. Note that the 
multimedia bitstream MBS recorded to the recording medium M contains the volume file structure VFS, which includes 
the physical address of the data on the recording medium generated by the video zone formatter 1300. 
[0056] Note that the encoded multimedia bitstream MBS St35 may be output directly to the decoder to immediately 
reproduce the edited title content. It will be obvious that the output multimedia bitstream MBS will not in this case 

10 contain the volume file structure VFS. 

Authoring decoder DC 

[0057] A preferred embodiment of the authoring decoder DC used to decode the multimedia bitstream MBS edited 
15 by the authoring encoder EC of the present invention, and thereby reproduce the content of each title unit according 
to the user-defined scenario, is described next below with reference to Fig. 3. Note that in the preferred embodiment 
described below the multimedia bitstream St45 encoded by the authoring encoder EC is recorded to the recording 
medium M. 

[0058] As shown in Fig. 3, the authoring decoder DC comprises a multimedia bitstream producer 2000, scenario 
20 selector 2100, decoding system controller 2300, stream buffer 2400, system decoder 2500, video buffer 2600, sub- 
picture buffer 2700, audio buffer 2800, synchronizer 2900, video decoder 3800, sub-picture decoder 31 00, audio de- 
coder 3200, synthesizer 3500, video data output terminal 3600, and audio data output terminal 3700. 
[0059] The bitstream producer 2000 comprises a recording media drive unit 2004 for driving the recording medium 
M; a reading head 2006 for reading the information recorded to the recording medium M and producing the binary read 
25 signal St57; a signal processor 2008 for variously processing the read signal St57 to generate the reproduced bitstream 
St61; and a reproduction controller 2002. 

[0060] The reproduction controller 2002 is connected to the decoding system controller 2300 from which the multi- 
media bitstream reproduction control signal St53 is supplied, and in turn generates the reproduction control signals 
St55 and St59 respectively controlling the recording media drive unit (motor) 2004 and signal processor 2008. 
30 [0061] So that the user-defined video, sub-picture, and audio portions of the multimedia title edited by the authoring 
encoder EC are reproduced, the authoring decoder DC comprises a scenario selector 2100 for selecting and repro- 
ducing the corresponding scenes (titles). The scenario selector 2100 then outputs the selected titles as scenario data 
to the authoring decoder DC. 

[0062] The scenario selector 2100 preferably comprises a keyboard, CPU, and monitor. Using the keyboard, the 
35 user then inputs the desired scenario based on the content of the scenario input by the authoring encoder EC. Based 
on the keyboard input, the CPU generates the scenario selection data St51 specifying the selected scenario. The 
scenario selector 21 00 is connected by an infrared communications device, for example, to the decoding system con- 
troller 2300, to which it inputs the scenario selection data St51 . 

[0063] Based on the scenario selection data St51 , the decoding system controller 2300 then generates the bitstream 
40 reproduction control signal St53 controlling the operation of the bitstream producer 2000. 

[0064] The stream buffer 2400 has a specific buffer capacity used to temporarily store the reproduced bitstream St61 
input from the bitstream producer 2000, extract the address information and initial synchronization data SCR (system 
clock reference) for each stream, and generate bitstream control data St63. The stream buffer 2400 is also connected 
to the decoding system controller 2300, to which it supplies the generated bitstream control data St63. 
45 [0065] The synchronizer 2900 is connected to the decoding system controller 2300 from which it receives the system 
clock reference SCR contained in the synchronization control data St8 1 to set the internal system clock STC and supply 
the reset system clock St79 to the decoding system controller 2300. 

[0066] Based on this system clock St79, the decoding system controller 2300 also generates the stream read signal 
St65 at a specific interval and outputs the read signal St65 to the stream buffer 2400. 
so [0067] Based on the supplied read signal St65, the stream buffer 2400 outputs the reproduced bitstream St61 at a 
specific interval to the system decoder 2500 as bitstream St67. 

[0068] Based on the scenario selection data St51, the decoding system controller 2300 generates the decoding 
signal St69 defining the stream Ids for the video, sub-picture, and audio bitstreams corresponding to the selected 
scenario, and outputs to the system decoder 2500. 
55 [0069] Based on the instructions contained in th e decoding signal St69, the system decoder 2500 respectively outputs 
the video, sub-picture, and audio bitstreams input from the stream buffer 2400 to the video buffer 2600, sub-picture 
buffer 2700, and audio buffer 2800 as the encoded video stream St71 , encoded sub-picture stream St73, and encoded 
audio stream St75. 
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[0070] The system decoder 2500 detects the presentation time stamp PTS and decoding time stamp DTS of the 
smallest control unit in each bitstream St67 to generate the time information signal St77. This time information signal 
St77 is supplied to the synchronizer 2900 through the decoding system controller 2300 as the synchronization control 
data St81. 

5 [0071] Based on this synchronization control data St81 , the synchronizer 2900 determines the decoding start timing 
whereby each of the bitstreams will be arranged in the correct sequence after decoding, and then generates and inputs 
the video stream decoding start signal St89 to the video decoder 3800 based on this decoding timing. The synchronizer 
2900 also generates and supplies the sub-picture decoding start signal St91 and audio stream decoding start signal 
St93 to the sub-picture decoder 31 00 and audio decoder 3200, respectively. 

10 [0072] The video decoder 3800 generates the video output request signal St84 based on the video stream decoding 
start signal St89, and outputs to the video buffer 2600. In response to the video output request signal St84, the video 
buffer 2600 outputs the video stream St83 to the video decoder 3800. The video decoder 3800 thus detects the pres- 
entation time information contained in the video stream St83, and disables the video output request signal St84 when 
the length of the received video stream St83 is equivalent to the specified presentation time. A video stream equal in 

15 length to the specified presentation time is thus decoded by the video decoder 3800, which outputs the reproduced 
video signal St1 04 to the synthesizer 3500, 

[0073] The sub-picture decoder 3100 similarly generates the sub-picture output request signal St86 based on the 
sub-picture decoding start signal St91 , and outputs to the sub-picture buffer 2700. In response to the sub-picture output 
request signal St86, the sub-picture buffer 2700 outputs the sub-picture stream St85 to the sub-picture decoder 31 00. 
20 Based on the presentation time information contained in the sub-picture stream St85, the sub-picture decoder 3100 
decodes a length of the sub-picture stream St85 corresponding to the specified presentation time to reproduce and 
supply to the synthesizer 3500 the sub-picture signal St99, 

[0074] The synthesizer 3500 superimposes the video signal St1 04 and sub-picture signal St99 to generate and output 
the multi-picture video signal St105 to the video data output terminal 3600. 

25 [0075] The audio decoder 3200 generates and supplies to the audio buffer 2800 the audio output request signal 
St88 based on the audio stream decoding start signal St93. The audio buffer 2800 thus outputs the audio stream St87 
to the audio decoder 3200. The audio decoder 3200 decodes a length of the audio stream St87 corresponding to the 
specified presentation time based on the presentation time information contained in the audio stream St87, and outputs 
the decoded audio stream St101 to the audio data output terminal 3700. 

30 [0076] It is thus possible to reproduce a user-defined multimedia bitstream MBS in real-time according to a user- 
defined scenario. More specifically, each time the user selects a different scenario, the authoring decoder DC is able 
to reproduce the title content desired by the user in the desired sequence by reproducing the multimedia bitstream 
MBS corresponding to the selected scenario. 

[0077] It is therefore possible by means of the authoring system of the present invention to generate a multimedia 
35 bitstream according to plural user-defined scenarios by real-time or batch encoding multimedia source data in a manner 
whereby the substreams of the smallest editing units (scenes), which can be divided into plural substreams, expressing 
the basic title content are arranged in a specific time-base relationship. 

[0078] The multimedia bitstream thus encoded can then be reproduced according to the one scenario selected from 
among plural possible scenarios. It is also possible to change scenarios while playback is in progress, i.e., to select a 
40 different scenario and dynamically generate a new multimedia bitstream according to the most recently selected sce- 
nario. It is also possible to dynamically select and reproduce any of plural scenes while reproducing the title content 
according to a desired scenario. 

[0079] It is therefore possible by means of the authoring system of the present invention to encode and not only 
reproduce but to repeatedly reproduce a multimedia bitstream MBS in real-time. 
45 [0080] A detail of the authoring system is disclosed Japanese Patent Application filed September 27, 1996, and 
entitled and assigned to the same assignee as the present application. 

DVD 



so [0081] An example of a digital video disk (DVD) with only one recording surface (a single-sided DVD) is shown in 
Fig. 4. 

[0082] The DVD recording medium RC1 in the preferred embodiment of the invention comprises a data recording 
surface RS1 to and from which data is written and read by emitting laser beam LS, and a protective layer PL1 covering 
the data recording surface RS1 . A backing layer BL1 is also provided on the back of data recording surface RS1 . The 
55 side of the disk on which protective layer PL1 is provided is therefore referred to below as side SA (commonly "side 
A"), and the opposite side (on which the backing layer BL1 is provided) is referred to as side SB ("side B"). Note that 
digital video disk recording media having a single data recording surface RS1 on only one side such as this DVD 
recording medium RC1 is commonly called a single-sided single layer disk. 
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[0083] A detailed illustration of area C1 in Fig. 4 is shown in Fig. 5. Note that the data recording surface RS1 is 
formed by applying a metallic thin film or other reflective coating as a data layer 41 09 on a first transparent layer 41 08 
having a particular thickness T1 . This first transparent layer 41 08 also functions as the protective layer PL1 . A second 
transparent substrate 4111 of a thickness T2 functions as the backing layer BL1 , and is bonded to the first transparent 

5 layer 41 08 by means of an adhesive layer 41 10 disposed therebetween. 

[0084] A printing layer 41 1 2 for printing a disk label may also be disposed on the second transparent substrate 41 1 1 
as necessary. The printing layer 41 1 2 does not usually cover the entire surface area of the second transparent substrate 
41 11 (backing layer BL1 ), but only the area needed to print the text and graphics of the disk label. The area of second 
transparent substrate 41 1 1 to which the printing layer 41 1 2 is not formed may be left exposed. Light reflected from the 

10 data layer 41 09 (metallic thin film) forming the data recording surface RS1 can therefore be directly..observed where 
the label is not printed when the digital video disk is viewed from side SB. As a result, the background looks like a 
silver-white over which the printed text and graphics float when the metallic thin film is an aluminum thin film, for 
example. 

[0085] Note that it is only necessary to provide the printing layer 41 1 2 where needed for printing, and it is not nec- 
*5 essary to provide the printing layer 4112 over the entire surface of the backing layer BL1 . 

[0086] A detailed illustration of area C2 in Fig. 5 is shown in Fig. 6. Pits and lands are molded to the common contact 
surface between the first transparent layer 41 08 and the data layer 41 09 on side SA from which data is read by emitting 
a laser beam LS, and data is recorded by varying the lengths of the pits and lands (i.e., the length of the intervals 
between the pits). More specifically, the pit and land configuration formed on the first transparent layer 41 08 is trans- 
it? ferred to the data layer 41 09. The lengths of the pits and lands are shorter, and the pitch of the data tracks formed by 
the pit sequences is narrower, than with a conventional Compact Disc (CD). The surface recording density is therefore 
greatly improved. 

[0087] Side SA of the first transparent layer 41 08 on which data pits are not formed is a flat surface. The second 
transparent substrate 4111 is for reinforcement, and is a transparent panel made from the same material as the first 
25 transparent layer 4108 with both sides flat. Thickness T1 and T2 are preferably equal and commonly approximately 
0.6 mm, but the invention shall not be so limited. 

[0088] As with a CD, information is read by irradiating the surface with a laser beam LS and detecting the change 
in the reflectivity of the light spot. Because the objective lens aperture NA can be large and the wavelength I of the 
light beam small in a digital video disk system, the diameter of the light spot Ls used can be reduced to approximately 
30 1/1 .6 the light spot needed to read a CD. Note that this means the resolution of the laser beam LS in the DVD system 
is approximately 1 .6 times the resolution of a conventional CD system. 

[0089] The optical system used to read data from the digital video disk uses a short 650 nm wavelength red semi- 
conductor laser and an objective lens with a 0.6 mm aperture NA. By thus also reducing the thickness T of the trans- 
parent panels to 0.6 mm, more than 5 GB of data can be stored to one side of a 120 mm diameter optical disk. 

35 [0090] It is therefore possible to store motion picture (video) images having an extremely large per unit data size to 
a digital video disk system disk without losing image quality because the storage capacity of a single-sided, single- 
layer recording medium RC1 with one data recording surface RS1 as thus described is nearly ten times the storage 
capacity of a conventional CD. As a result, while the video presentation time of a conventional CD system is approx- 
imately 74 minutes if image quality is sacrificed, high quality video images with a video presentation time exceeding 

40 two hours can be recorded to a DVD. 

[0091] The digital video disk is therefore well-suited as a recording medium for video images. 
[0092] A digital video disk recording medium with plural recording surfaces RS as described above is shown in Figs. 
7 and 8. The DVD recording medium RC2 shown in Fig. 7 comprises two recording surfaces, i.e., first recording surface 
RS1 and semi-transparent second recording surface RS2, on the same side, i.e. side SA, of the disk. Data can be 

45 simultaneously recorded or reproduced from these two recording surfaces by using different laser beams LS1 and LS2 
for the first recording surface RS1 and the second recording surface RS2. It is also possible to read/write both recording 
surfaces RS1 and RS2 using only one of the laser beams LS1 or LS2. Note that recording media thus comprised are 
called "single-side, dual-layer disks." 

[0093] It should also be noted that while two recording surfaces RS1 and RS2 are provided in this example, it is also 
50 possible to produce digital video disk recording media having more than two recording surfaces RS. Disks thus com- 
prised are known as "single-sided, multi-layer disks." 

[0094] Though comprising two recording surfaces similarly to the recording media shown in Fig. 7, the DVD recording 
medium RC3 shown in Fig. 8 has the recording surfaces on opposite sides of the disk, i. e., has the first data recording 
surface RS1 on side SA and the second data recording surface RS2 on side SB. It will also be obvious that while only 
55 two recording surfaces are shown on one digital video disk in this example, more than two recording surfaces may 
also be formed on a double-sided digital video disk. As with the recording medium shown in Fig. 7, it is also possible 
to provide two separate laser beams LS1 and LS2 for recording surfaces RS1 and RS2, or to read/write both recording 
surfaces RS1 and RS2 using a single laser beam. Note that this type of digital video disk is called a "double-sided, 
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dual-layer disk." It will also be obvious that a double-sided digital video diskcan be comprised with two or more recording 
surfaces per side. This type of disk is called a "double-sided, multi-layer disk." 

[0095] A plan view from the laser beam LS irradiation side of the recording surface RS of the DVD recording medium 
RC is shown in Fig. 9 and Fig. 10. Note that a continuous spiral data recording track TR is provided from the inside 
5 circumference to the outside circumference of the DVD. The data recording track TR is divided into plural sectors each 
having the same known storage capacity. Note that for simplicity only the data recording track TR is shown in Fig. 9 
with more than three sectors per revolution. 

[0096] As shown in Fig. 9, the data recording track TR is normally formed clockwise inside to outside (see arrow 
DrA) from the inside end point IA at the inside circumference of disk RCA to the outside end point OA at the outside 
10 circumference of the disk with the disk RCA rotating counterclockwise RdA.This type of disk RCA is called a clockwise 
disk, and the recording track formed thereon is called a clockwise track TRA. 

[0097] Depending upon the application, the recording track TRB may be formed clockwise from outside to inside 
circumference (see arrow DrB in Fig. 10) from the outside end point OB at the outside circumference of disk RCB to 
the inside end point IB at the inside circumference of the disk with the disk RCB rotating clockwise RdB. Because the 

15 recording track appears to wind counterclockwise when viewed from the inside circumference to the outside circum- 
ference on disks with the recording track formed in the direction of arrow DrB, these disks are referred to as counter- 
clockwise disk RCB with counterclockwise track TRB to distinguish them from disk RCA in Fig. 9. Note that track 
directions DrA and DrB are the track paths along which the laser beam travels when scanning the tracks for recording 
and playback. Direction of disk rotation RdA in which disk RCA turns is thus opposite the direction of track path DrA, 

20 and direction of disk rotation RdB in which disk RCB turns is thus opposite the direction of track path DrB. 

[0098] An exploded view of the single-sided, dual-layer disk RC2 shown in Fig. 7 is shown as disk RC2o in Fig. 11 . 
Note that the recording tracks formed on the two recording surfaces run in opposite directions. Specifically, a clockwise 
recording track TRA as shown in Fig. 9 is formed in clockwise direction DrA on the (lower) first data recording surface 
RS1, and a counterclockwise recording track TRB formed in counterclockwise direction DrB as shown in Fig. 10 is 

25 provided on the (upper) second data recording surface RS2. As a result, the outside end points OA and OB of the first 
and second (top and bottom) tracks are at the same radial position relative to the center axis of the disk RC2o. Note 
that track paths DrA and DrB of tracks TR are also the data read/write directions to disk RC. The first and second (top 
and bottom) recording tracks thus wind opposite each other with this disk RC, i.e., the track paths DrA and DrB of the 
top and bottom recording layers are opposite track paths. 

30 [0099] Opposite track path type, single-sided, dual-layer disks RC2o rotate in direction RdA corresponding to the 
first recording surface RS1 with the laser beam LS traveling along track path DrA to trace the recording track on the 
first recording surface RS1 . When the laser beam LS reaches the outside end point OA, the laser beam LS can be 
refocused to end point OB on the second recording surface RS2 to continue tracing the recording track from the first 
to the second recording surface uninterrupted. The physical distance between the recording tracks TRA and TRB on 

35 the first and second recording surfaces RS1 and RS2 can thus be instantaneously eliminated by simply adjusting the 
focus of the laser beam LS. 

[0100] It is therefore possible with an opposite track path type, single-sided, dual-layer disk RC2o to easily process 
the recording tracks disposed to physically discrete top and bottom recording surfaces as a single contin uous recording 
track. It is therefore also possible in an authoring system as described above with reference to Fig. 1 to continuously 
40 record the multimedia bitstream MBS that is the largest multimedia data management unit to two discrete recording 
surfaces RS1 and RS2 on a single recording medium RC2o. 

[01 01 ] It should be noted that the tracks on recording surfaces RS1 and RS2 can be wound in the directions opposite 
those described above, i.e., the counterclockwise track TRB may be provided on the first recording surface RS1 and 
the clockwise track TRA on the second recording surface RS2. In this case the direction of disk rotation is also changed 
45 to a clockwise rotation RdB, thereby enabling the two recording surfaces to be used as comprising a single continuous 
recording track as described above. For simplification, a further example of this type of disk is therefore neither shown 
nor described below. 

[0102] It is therefore possible by thus constructing the digital video disk to record the multimedia bitstream MBS for 
a feature-length title to a single opposite track path type, single-sided, dual-layer disk RC2o. Note that this type of 

50 digital video disk medium is called a single-sided dual-layer disk with opposite track paths. 

[0103] Another example of the single-sided, dual-layer DVD recording medium RC2 shown in Fig. 7 is shown as disk 
RC2p in Fig. 12. The recording tracks formed on both first and second recording surfaces RS1 and RS2 are clockwise 
tracks TRA as shown in Fig. 9. In this case, the single-sided, dual-layer disk RC2p rotates counterclockwise in the 
direction of arrow RdA, and the direction of laser beam LS travel is the same as the direction of the track spiral, i.e., 

55 the track paths of the top and bottom recording surfaces are mutually parallel (parallel track paths). The outside end 
points OA of both top and bottom tracks are again preferably positioned at the same radial position relative to the 
center axis of the disk RC2p as described above. As also described above with disk RC2o shown in Fig. 1 1 , the access 
point can be instantaneously shifted from outside end point OA of track TRA on the first recording surface RS1 to the 
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outside end point OA of track TRA on the second recording surface RS2 by appropriately adjusting the focus of the 
laser beam LS at outside end point OA. 

[0104] However, for the laser beam LS to continuously access the clockwise recording track TRA on the second 
recording surface RS2, the recording medium RC2p must be driven in the opposite direction (clockwise, opposite 
5 direction RdA). Depending on the radial position of the laser beam LS, however, it is inefficient to change the rotational 
direction of the recording medium. As shown by the diagonal arrow in Fig. 12, the laser beam LS is therefore moved 
from the outside end point OA of the track on the first recording surface RS1 to the inside end point IA of the track on 
the second recording surface RS2 to use these physically discrete recording tracks as one logically continuous record- 
ing track. 

10 [0105] Rather than using the recording tracks on , top and bottom recording surfaces as one continuous recording 
track, it is also possible to use the recording tracks to record the multimedia bitstreams MBS for different titles. This 
type of digital video disk recording medium is called a "single-sided, dual-layer disk with parallel track paths." 
[0106] Note that if the direction of the tracks formed on the recording surfaces RS1 and RS2 is opposite that described 
above, i.e., counterclockwise recording tracks TRB are formed, disk operation remains the same as that described 

15 above except for the direction of disk rotation, which is clockwise as shown by arrow RdB. 

[0107] Whether using clockwise or counterclockwise recording tracks, the single-sided, dual-layer disk RC2p with 
parallel track paths thus described is well-suited to storing on a single disk encyclopedia and similar multimedia bit- 
streams comprising multiple titles that are frequently and randomly accessed. 

[0108] An exploded view of the dual-sided single-layer DVD recording medium RC3 comprising one recording surface 
20 layer RS1 and RS2 on each side as shown in Fig. 8 is shown as DVD recording medium RC3s in Fig. 13. Clockwise 
recording track TRA is provided on the one recording surface RS1, and a counterclockwise recording track TRB is 
provided on the other recording surface RS2. As in the preceding recording media, the outside end points OA and OB 
of the recording tracks on each recording surface are preferably positioned at the same radial position relative to the 
center axis of the DVD recording medium RC3s. 
25 [0109] Note that while the recording tracks on these recording surfaces RS1 and RS2 rotate in opposite directions, 
the track paths are symmetrical. This type of recording medium is therefore known as a double-sided dual layer disk 
with symmetrical track paths. This double-sided dual layer disk with symmetrical track paths RC3s rotates in direction 
RdA when reading/writing the first recording surface RS1 . As a result, the track path on the second recording surface 
RS2 on the opposite side is opposite the direction DrB in which the track winds, i.e., direction DrA. Accessing both 
30 recording surfaces RS1 and RS2 using a single laser beam LS is therefore not realistic irrespective of whether access 
is continuous or non-continuous. In addition, a multimedia bitstream MBS is separately recorded to the recording sur- 
faces on the first and second sides of the disk. 

[0110] A different example of the double-sided single layer disk RC3 shown in Fig. 8 is shown in Fig. 14 as disk 
RC3a. Note that this disk comprises clockwise recording tracks TRA as shown in Fig. 9 on both recording surfaces 

35 RS1 and RS2. As with the preceding recording media, the outside end points OA and OA of the recording tracks on 
each recording surface are preferably positioned at the same radial position relative to the center axis of the DVD 
recording medium RC3a. Unlike the double-sided dual layer disk with symmetrical track paths RC3s described above, 
the tracks on these recording surfaces RS1 and RS2 are asymmetrical. This type of disk is therefore known as a 
double-sided dual layer disk with asymmetrical track paths. This double-sided dual layer disk with asymmetrical track 

40 paths RC3a rotates in direction RdA when reading/writing the first recording surface RS1 . As a result, the track path 
on the second recording surface RS2 on the opposite side is opposite the direction DrA in which the track winds, i.e., 
direction DrB. 

[0111] This means that if a laser beam LS is driven continuously from the inside circumference to the outside cir- 
cumference on the first recording surface RS1 , and then from the outside circumference to the inside circumference 
45 on the second recording surface RS2, both sides of the recording medium RC3a can be read/written without turning 
the disk over and without providing different laser beams for the two sides. 

[0112] The track paths for recording surfaces RS1 and RS2 are also the same with this double-sided dual layer disk 
with asymmetrical track paths RC3a. As a result, it is also possible to read/write both sides of the disk without providing 
separate laser beams for each side if the recording medium RC3a is turned over between sides, and the read/write 

50 apparatus can therefore be constructed economically. 

[01 1 3] It should be noted that this recording medium remains functionally identical even if counterclockwise recording 
track TRB is provided in place of clockwise recording track TRA on both recording surfaces RS1 and RS2. 
[0114] As described above, the true value of a DVD system whereby the storage capacity of the recording medium 
can be easily increased by using a multiple layer recording surface is realized in multimedia applications whereby plural 

55 video data units, plural audio data units, and plural graphics data units recorded to a single disk are reproduced through 
interactive operation by the user. 

[0115] It is therefore possible to achieve one long-standing desire of software (programming) providers, specifically, 
to provide programming content such as a commercial movie on a single recording medium in plural versions for 
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different language and demographic groups while retaining the image quality of the original. 
Parental control 

5 [0116] Content providers of movie and video titles have conventionally had to produce, supply, and manage the 
inventory of individual titles in multiple languages, typically the language of each distribution market, and multi-rated 
title packages conforming to the parental control (censorship) regulations of individual countries in Europe and North 
America. The time and resources required for this are significant. While high image quality is obviously important, the 
programming content must also be consistently reproducible. 

10 [0117] The digital video disk recording medium is close to solving these problems, ,. . 

Multiple angles 

. [0118] One interactive operation widely sought in multimedia applications today is for the user to be able to change 
15 the position from which a scene is viewed during reproduction of that scene. This capability is achieved by means of 
the multiple angle function. 

[0119] This multiple angle function makes possible applications whereby, for example, a user can watch a baseball 
game from different angles (or virtual positions in the stadium), and can freely switch between the views while viewing 
is in progress. In this example of a baseball game, the available angles may include a position behind the backstop 

20 cen tered on the catcher, batter, and pitcher; one from behind the backstop centered on a fielder, the pitcher, and the 
catcher; and one from center field showing the view to the pitcher and catcher. To meet these requirements, the digital 
video disk system uses MPEG, the same basic standard format used with Video-CDs to record the video, audio, graph- 
ics, and other signal data. Because of the differences in storage capacity, transfer rates, and signal processing per- 
formance within the reproduction apparatus, DVD uses MPEG2, the compression method and data format of which 

25 differ slightly from the MPEG1 format used with Video-CDs. 

[0120] It should be noted that the content of and differences between the MPEG1 and MPEG2 standards have no 
direct relationship to the intent of the present invention, and further description is therefore omitted below (for more 
information, see MPEG specifications ISO-11172 and ISO-13818). 

[0121] The data structure of the DVD system according to the present invention is described in detail below with 
30 reference to Figs. 1 6, 1 7, 1 8, 1 9, 20, and 21 . 

Multi-scene control 

[0122] A fully functional and practical parental lock playback function and multi-angle scene playback function must 
35 enable the user to modify the system output in minor, subtle ways while still presenting substantially the same video 
and audio output. If these functions are achieved by preparing and recording separate titles satisfying each of the many 
possible parental lock and multi-angle scene playback requests, titles that are substantially identical and differ in only 
minor ways must be recorded to the recording medium. This results in identical data being repeatedly recorded to the 
larger part of the recording medium, and significantly reduces the utilization efficiency of the available storage capacity. 
40 More particularly, it is virtually impossible to record discrete titles satisfying every possible request even using the 
massive capacity of the digital video disk medium. While it may be concluded that this problem can be easily solved 
by increasing the capacity of the recording medium, this is an obviously undesirable solution when the effective use 
of available system resources is considered. 

[0123] Using multi-scene control, the concept of which is described in another section below, in a DVD system, it is 
45 possible to dynamically construct titles for numerous variations of the same basic content using the smallest possible 
amount of data, and thereby effectively utilize the available system resources (recording medium). More specifically, 
titles that can be played back with numerous variations are constructed from basic (common) scene periods containing 
data common to each title, and multi-scene periods comprising groups of different scenes corresponding to the various 
requests. During reproduction, the user is able to freely and at any time select particular scenes from the multi-scene 
so periods to dynamically construct a title conforming to the desired content, e.g., a title omitting certain scenes using the 
parental lock control function. 

[0124] Note that multi-scene control enabling a parental lock playback control function and multi-angle scene play- 
back is described in another section below with reference to Fig. 21 . 

55 Data structure of the DVD system 

[0125] The data structure used in the authoring system of a digital video disk system according to the present in- 
vention is shown in Fig. 22. To record a multimedia bitstream MBS, this digital video disk system divides the recording 



EP 0 918 438 B1 



medium into three major recording areas, the lead-in area LI, the volume space VS, and the lead-out area LO. 
[01 26] The lead-in area LI is provided at the inside circumference area of the optical disk. In the disks described with 
reference to Figs. 9 and 10, the lead-in area LI is positioned at the inside end points IA and IB of each track. Data for 
stabilizing the operation of the reproducing apparatus when reading starts is written to the lead-in area LI. 
5 [0127] The lead-out area LO is correspondingly located at the outside circumference of the optical disk, i.e., at outside 
end points OA and OB of each track in the disks described with reference to Figs. 9 and 10. Data identifying the end 
of the volume space VS is recorded in this lead-out area LO. 

[0128] The volume space VS is located between the lead-in area LI and lead-out area LO, and is recorded as a one- 
dimensional array of n+1 (where n is an integer greater than or equal to zero) 2048-byte logic sectors LS. The logic 
10 sectors LS are sequentially number #0, #1, #2, ... #n. The volume space VS is also dividedJnto a volume and file 
structure management area VFS and a file data structure area FDS. 

[0129] The volume and file structure management area VFS comprises m+1 logic sectors LS#0 to LS#m (where m 
is an integer greater than or equal to zero and less than n. The file data structure FDS comprises n-m logic sectors LS 
#m+1 to LS #n. 

15 [0130] Note that this file data structure area FDS corresponds to the multimedia bitstream MBS shown in Fig. 1 and 
described above. 

[01 31 ] The volume file structure VFS is the file system for managing the data stored to the volume space VS as files, 
and is divided into logic sectors LS#0 - LS#m where m is the number of sectors required to store all data needed to 
manage the entire disk, and is a natural number less than n. Information for the files stored to the file data structure 
20 area FDS is written to the volume file structure VFS according to a known specification such as ISO-9660 or ISO-1 3346. 
[0132] The file data structure area FDS comprises n-m logic sectors LS#m - LS#n, each comprising a video manager 
VMG sized to an integer multiple of the logic sector (2048 x I, where I is a known integer), and k video title sets VTS 
#1 - VTS#k (where k is a natural number less than 100). 

[0133] The video manager VMG stores the title management information for the entire disk, and information for 
25 building a volume menu used to set and change reproduction control of the entire volume. 

[0134] Any video title set VTS #k is also called a "video file" representing a title comprising video, audio, and/or still 
image data. 

[0135] The internal structure of each video title set VTS shown in Fig. 22 is shown in Fig. 16. Each video title set 
VTS comprises VTS information VTSI describing the management information for the entire disk, and the VTS title 
30 video objects VOB (VTSTT_VOBS), i.e., the system stream of the multimedia bitstream. The VTS information VTSI is 
described first below, followed by the VTS title VOBS. 

[0136] The VTS information primarily includes the VTSI management table VTSLMAT and VTSPGC information 
table VTS_PGCIT. 

[0137] The VTSI management table VTSI_MAT stores such information as the internal structure of the video title set 
35 VTS, the number of selectable audio streams contained in the video title set VTS, the number of sub-pictures, and the 
video title set VTS location (storage address). 

[0138] The VTSPGC information table VTS_PGCIT records i (where i is a natural number) program chain (PGC) 
data' blocks VTS_PGCI #1 - VTS_PGCI #i for controlling the playback sequence. Each of the table entries VTS_PGCI 
#i is a data entry expressing the program chain, and comprises j (where j is a natural number) cell playback information 
40 blocks C_PBI #1 - C_PBI #j. Each cell playback information block C_PBI #j contains the playback sequence of the cell 
and playback control information. 

[01 39] The program chain PGC is a conceptual structure describing the story of the title content, and therefore defines 
the structure of each title by describing the cell playback sequence. Note that these cells are described in detail below. 
[01 40] If, for example, the video title set information relates to the menus, the video title set information VTSI is stored 

45 to a buffer in the playback device when playback starts. If the user then presses a MENU button on a remote control 
device, for example, during playback, the playback device references the buffer to fetch the menu information and 
display the top menu #1 . If the menus are hierarchical, the main menu stored as program chain information VTS_PGCI 
#1 may be displayed, for example, by pressing the MENU button, VTS_PGCI #2 - #9 may correspond to submenus 
accessed using the numeric keypad on the remote control, and VTS_PGCI #1 0 and higher may correspond to additional 

50 submenus further down the hierarchy. Alternatively, VTS_PGCI #1 may be the top menu displayed by pressing the 
MENU button, while VTS_PGCI #2 and higher may be voice guidance reproduced by pressing the corresponding 
numeric key. 

[0141] The menus themselves are expressed by the plural program chains defined in this table. As a result, the 
menus may be freely constructed in various ways, and shall not be limited to hierarchical or non-hierarchical menus 
55 or menus containing voice guidance. 

[0142] In the case of a movie, for example, the video title set information VTSI is stored to a buffer in the playback 
device when playback starts, the playback device references the cell playback sequence described by the program 
chain PGC, and reproduces the system stream. 
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[0143] The "cells" referenced here may be all or part of the system stream, and are used as access points during 
playback. Cells can therefore be used, for example, as the "chapters" into which a title may be divided. 
[0144] Note that each of the PGC information entries C_PBI #j contain both cell playback processing information 
and a cell information table. The cell playback processing information comprises the processing information needed 
5 to reproduce the cell, such as the presentation time and number of repetitions. More specifically, this information in- 
cludes the cell block mode CBM, cell block type CBT, seamless playback flag SPF, interleaved allocation flag IAF, STC 
resetting flag STCDF, cell presentation time C_PBTM , seamless angle change flag SACF, first cell VOBU start address 
C.FVOBILSA, and the last cell VOBU start address C_LVOBU_SA. 

[0145] Note that seamless playback refers to the reproduction in a digital video disk system of multimedia data 
10 including video, audio, and sub-picture data without intermittent breaks in the data or information. Seamless playback 
is described in detail in another section below with reference to Fig. 23 and Fig. 24. 

[0146] The cell block mode CBM indicates whether plural cells constitute one functional block. The cell playback 
information of each cell in a functional block is arranged consecutively in the PGC information. The cell block mode 
CBM of the first cell playback information in this sequence contains the value of the first cell in the block, and the cell 
*5 block mode CBM of the last cell playback information in this sequence contains the value of the last cell in the block. 
The cell block mode CBM of each cell arrayed between these first and last cells contains a value indicating that the 
cell is a cell between these first and last cells in that block. 

[0147] The cell block type CBT identifies the type of the block indicated by the cell block mode CBM. For example, 
when a multiple angle function is enabled, the cell information corresponding to each of the reproducible angles is 
20 programmed as one of the functional blocks mentioned above, and the type of these functional blocks is defined by a 
value identifying "angle" in the cell block type CBT for each cell in that block. 

[0148] The seamless playback flag SPF simply indicates whether the corresponding cell is to be linked and played 
back seamlessly with the cell or cell block reproduced immediately therebefore. To seamlessly reproduce a given cell 
with the preceding cell or cell block, the seamless playback flag SPF is set to 1 in the cell playback information for that 

25 cell; otherwise SPF is set to 0. 

[0149] The interleaved allocation flag IAF stores a value identifying whether the cell exists in a contiguous or inter- 
leaved block. If the cell is part of an interleaved block, the flag IAF is set to 1 ; otherwise it is set to 0. 
[0150] The STC resetting flag STCDF identifies whether the system time clock STC used for synchronization must 
be reset when the cell is played back; when resetting the system time clock STC is necessary, the STC resetting flag 

30 STCDF is set to 1. 

[01 51 ] The seamless angle change flag SACF stores a value indicating whether a cell in a multi-angle period should 
be connected seamlessly at an angle change. If the angle change is seamless, the seamless angle change flag SACF 
is set to 1 ; otherwise it is set to 0. 

[0152] The cell presentation time C_PBTM expresses the cell presentation time with video frame precision. 
35 [0153] The first cell VOBU start address C_FVOBU_SA is the VOBU start address of the first cell in a block, and is 
also expressed as the distance from the logic sector of the first cell in the VTS title VOBS (VTSTT_VOBS) as measured 
by the number of sectors. 

[0154] The last cell VOBU start address C_LVOBU_SA is the VOBU start address of the last cell in the block. The 
value of this address is expressed as the distance from the logic sector of the first cell in the VTS title VOBS 

40 (VTSTT_VOBS) as measured by the number of sectors. 

[0155] The VTS title VOBS (VTSTT_VOBS), i.e., the multimedia system stream data, is described next. The system 
stream data VTSTT_VOBS comprises i (where i is a natural number) system streams SS, each of which is referred to 
as a "video object" (VOB). Each video object VOB #1 - VOB #i comprises at least one video data block interleaved 
with up to a maximum eight audio data blocks and up to a maximum 32 sub-picture data blocks. 

45 [0156] Each video object VOB comprises q (where q is a natural number) cells C#1 - C#q. Each cell C comprises r 
(where r is a natural number) video object units VOBU #1-VOBU #r. 

[0157] Each video object unit VOBU comprises plural group_of pictures GOP, and the audio and sub-pictures cor- 
responding to the playback of said plural group_of_pictures GOP. Note that the group_of pictures GOP corresponds 
to the video encoding refresh cycle. Each video object unit VOBU also starts with an NV pack, i.e., the control data for 
50 that VOBU. 

[0158] The structure of the navigation packs NV is described with reference to Fig. 18. 

[0159] Before describing the navigation pack NV, the internal structure of the video zone VZ (see Fig. 22), i.e., the 
system stream St35 encoded by the authoring encoder EC described with reference to Fig. 25, is described with 
reference to Fig. 17. Note that the encoded video stream St15 shown in Fig. 17 is the compressed one-dimensional 
55 video data stream encoded by the video encoder 300. The encoded audio stream St19 is likewise the compressed 
one-dimensional audio data stream multiplexing the right and left stereo audio channels encoded by the audio encoder 
700. Note that the audio signal shall not be limited to a stereo signal, and may also be a multichannel surround-sound 
signal. 
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[0160] The system stream (title editing unit VOB) St35 is a one dimensional array of packs with a byte size corre- 
sponding to the logic sectors LS #n having a 2048-byte capacity as described using Fig. 21 . A stream control pack is 
placed at the beginning of the title editing unit (VOB) St35, i.e., at the beginning of the video object unit VOBU. This 
stream control pack is called the "navigation pack NV", and records the data arrangement in the system stream and 

5 other control information. 

[01 61 ] The encoded video stream St1 5 and the encoded audio stream St1 9 are packetized in byte units correspond- 
ing to the system stream packs. These packets are shown in Fig. 17 as packets V1 , V2, V3, V4... and A1 , A2, A3.... 
As shown in Fig. 17, these packets are interleaved in the appropriate sequence as system stream St35, thus forming 
a packet stream, with consideration given to the decoder buffer size and the time required by the decoder to expand 

10 the video and audio data packets. In the example shown in Fig. 17, the packet stream is interleaved in the sequence 
V1, V2, A1, V3, V4, A2.... 

[01 62] Note that the sequence shown in Fig. 1 7 interleaves one video data unit with one audio data unit. Significantly 
increased recording/playback capacity, high speed recording/playback, and performance improvements in the signal 
processing LSI enable the DVD system to record plural audio data and plural sub-picture data (graphics data) to one 
15 video data unit in a single interleaved MPEG system stream, and thereby enable the user to select the specific audio 
data and sub-picture data to be reproduced during playback. The structure of the system stream used in this type of 
DVD system is shown in Fig. 18 and described below. 

[0163] As in Fig. 17, the packetized encoded video stream St15 is shown in Fig. 18 as V1, V2, V3, V4, ... In this 
example, however, there is not just one encoded audio stream St1 9, but three encoded audio streams St1 9A, St1 9B, 
20 and St1 9C input as the source data. There are also two encoded sub-picture streams St1 7A and St1 7B input as the 
source data sub-picture streams. These six compressed data streams, St15, Stl9A, St19B, St19C, Stl7A and St17B, 
are interleaved to a single system stream St35. 

[0164] The video data is encoded according to the MPEG specification with the group_of_pictures GOP being the 
unit of compression. In general, each group_of _pictures GOP contains 15 frames in the case of an NTSC signal, but 

25 the specific number of frames compressed to one GOP is variable. The stream management pack, which describes 
the management data containing, for example, the relationship between interleaved data, is also interleaved at the 
GOP unit interval. Because the group_of_pictures GOP unit is based on the video data, changing the number of video 
frames per GOP unit changes the interval of the stream management packs. This interval is expressed in terms of the 
presentation time on the digital video disk within a range from 0.4 sec. to 1 .0 sec. referenced to the GOP unit. If the 

30 presentation time of contiguous plural GOP units is less than 1 sec, the management data packs for the video data 
of the plural GOP units is interleaved to a single stream. 

[0165] These management data packs are referred to as navigation packs NV in the digital video disk system. The 
data from one navigation pack NV to the packet immediately preceding the next navigation pack NV forms one video 
object unit VOBU. In general, one contiguous playback unit that can be defined as one scene is called a video object 
35 VOB, and each video object VOB contains plural video object units VOBU. Data sets of plural video objects VOB form 
a VOB set (VOBS). Note that these data units were first used in the digital video disk. 

[01 66] When plural of these data streams are interleaved, the navigation packs N V defining the relationship between 
the interleaved packs must also be interleaved at a defined unit known as the pack number unit. Each group__of ^pictures 
GOP is normally a unit containing approximately 0. 5 sec. of video data, which is equivalent to the presentation time 
40 required for 12 - 15 frames, and one navigation pack NV is generally interleaved with the number of data packets 
required for this presentation time. 

[0167] The stream management information contained in the interleaved video, audio, and sub-picture data packets 
constituting the system stream is described below with reference to Fig. 19 As shown in Fig. 19, the data contained in 
the system stream is recorded in a format packed or packetized according to the MPEG2 standard. The packet structure 
45 is essentially the same for video, audio, and sub-picture data. One pack in the digital video disk system has a 2048 
byte capacity as described above, and contains a pack header PKH and one packet PES; each packet PES contains 
a packet header PTH and data block. 

[0168] The pack header PKH records the time at which that pack is to be sent from stream buffer 2400 to system 
decoder 2500 (see Fig. 26), i.e., the system clock reference SCR defining the reference time for synchronized audio- 

50 visual data playback. The MPEG standard assumes that the system clock reference SCR is the reference clock for 
the entire decoder operation. With such disk media as the digital video disk, however, time management specific to 
individual disk players can be used, and a reference clock for the decoder system is therefore separately provided. 
[0169] The packet header PTH similarly contains a presentation time stamp PTS and a decoding time stamp DTS, 
both of which are placed in the packet before the access unit (the decoding unit). The presentation time stamp PTS 

55 defines the time at which the video data or audio data contained in the packet should be output as the playback output 
after being decoded, and the decoding time stamp DTS defines the time at which the video stream should be decoded. 
Note that the presentation time stamp PTS effectively defines the display start timing of the access unit, and the de- 
coding time stamp DTS effectively defines the decoding start timing of the access unit. If the PTS and DTS are the 
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same time, the DTS is omitted. 

[0170] The packet header PTH also contains an 8-bit field called the stream ID identifying the packet type, i.e., 
whether the packet is a video packet containing a video data stream, a private packet, or an MPEG audio packet. 
[0171] Private packets under the MPEG2 standard are data packets of which the content can be freely defined. 
s Private packet 1 in this embodiment of the invention is used to carry audio data other than the MPEG audio data, and 
sub-picture data; private packet 2 carries the PCI packet and DSI packet. 

[0172] Private packets 1 and 2 each comprise a packet header, private data area, and data area. The private data 
area contains an 8-bit sub-stream ID indicating whether the recorded data is audio data or sub-picture data. The audio 
data defined by private packet 2 may be defined as any of eight types #0 - #7 of linear PCM or AC-3 encoded data. 
10 Sub-picture data may be defined as one of up to 32 types #0 - #31 . 

[0173] The data area is the field to which data compressed according to the MPEG2 specification is written if the 
stored data is video data; linear PCM, AC-3, or MPEG encoded data is written if audio data is stored; or graphics data 
compressed by runlength coding is written if sub-picture data is stored. 

[0174] MPEG2 -compressed video data may be compressed by constant bit rate (CBR) or variable bit rate (VBR) 
*5 coding. With constant bit rate coding, the video stream is input continuously to the video buffer at a constant rate. This 
contrasts with variable bit rate coding in which the video stream is input intermittently to the video buffer, thereby making 
it possible to suppress the generation of unnecessary code. Both constant bit rate and variable bit rate coding can be 
used in the digital video disk system. 

[0175] Because MPEG video data is compressed with variable length coding, the data quantity in each 
20 group_of jDictures GOP is not constant. The video and audio decoding times also differ, and the time-base relationship 
between the video and audio data read from an optical disk, and the time-base relationship between the video and 
audio data output from the decoder, do not match. The method of time-base synchronizing the video and audio data 
is therefore described in detail below with reference to Fig. 26, but is described briefly below based on constant bit 
rate coding. 

25 [0176] The navigation pack NV structure is shown in Fig, 20. Each navigation pack NV starts with a pack header 
PKH, and contains a PCI packet and DSI packet. 

[01 77] As described above, the pack header PKH records the time at which that pack is to be sent from stream buffer 
2400 to system decoder 2500 (see Fig. 26 ), i.e., the system clock reference SCR defining the reference time for 
synchronized audio-visual data playback. 
30 [0178] Each PCI packet contains PCI General Information (PCI_GI) and Angle Information for Non-seamless play- 
back (NMSL_AGLI). 

The PCI General Information (PCI_GI) declares the display time of the first video frame (the Start PTM of VOBU 
(VOBU_S_PTM)), and the display time of the last video frame (End PTM of VOBU (VOBU_E_PTM)), in the corre- 
sponding video object unit VOBU with system clock precision (90 KHz). 

35 [0179] The Angle Information for Non-seamless playback (NMSL_AGLI) states the read start address of the corre- 
sponding video object unit VOBU when the angle is changed expressed as the number of sectors from the beginning 
of the video object VOB. Because there are nine or fewer angles in this example, there are nine angle address decla- 
ration cells: Destination Address of Angle Cell #1 for Non-seamless playback ( N M S L_ AG L_C 1 _ D STA) to Destination 
Address of Angle Cell #9 for Non-seamless playback (NMSL_AGL_C9_DSTA). 

40 [0180] Each DSI packet contains DSI General Information (DSI_GI), Seamless Playback Information (SML_PBI), 
and Angle Information for Seamless playback (SML_AGLI). 

[0181] The DSI General Information (DSI_GI) declares the address of the last pack in the video object unit VOBU, 
i. e. , the End Address for VOB (VOBU_EA), expressed as the number of sectors from the beginning of the video object 
unit VOBU. 

45 [0182] While seamless playback is described in detail later, it should be noted that the continuously read data units 
must be interleaved (multiplexed) at the system stream level as an interleaved unit ILVU in order to seamlessly repro- 
duce split or combined titles. Plural system streams interleaved with the interleaved unit ILVU as the smallest unit are 
defined as an interleaved block. 

[0183] The Seamless Playback Information (SML_PBI) is declared to seamlessly reproduce the stream interleaved 
50 with the interleaved unit ILVU as the smallest data unit, and contains an Interleaved Unit Flag (ILVU flag) identifying 
whether the corresponding video object unit VOBU is an interleaved block. The ILVU flag indicates whether the video 
object unit VOBU is in an interleaved block, and is set to 1 when it is. Otherwise the ILVU flag is set to 0. 
[01 84] When a video object unit VOBU is in an interleaved block, a Unit END flag is declared to indicate whether the 
video object unit VOBU is the last VOBU in the interleaved unit ILVU. Because the interleaved unit ILVU is the data 
55 unit for continuous reading, the Unit END flag is set to 1 if the VOBU currently being read is the last VOBU in the 
interleaved unit ILVU. Otherwise the Unit END flag is set to 0. 

[0185] An Interleaved Unit End Address (ILVU_EA) identifying the address of the last pack in the ILVU to which the 
VOBU belongs, and the starting address of the next interleaved unit ILVU, Next Interleaved Unit Start Address 
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(NTJLVU^SA), are also declared when a video object unit VOBU is in an interleaved block. Both the Interleaved Unit 
End Address (ILVU_EA) and Next Interleaved Unit Start Address (NTJLVU_SA) are expressed as the number of 
sectors from the navigation pack NV of that VOBU. 

[0186] When two system streams are seamlessly connected but the audio components of the two system streams 
are not contiguous, particularly immediately before and after the seam, it is necessary to pause the audio output to 
synchronize the audio and video components of the system stream following the seam. Note that non-contiguous audio 
may result from different audio signals being recording with the corresponding video blocks. With an NTSC signal, for 
example, the video frame cycle is approximately 33. 33 msec while the AC-3 audio frame cycle is 32 msec. 
[0187] To enable this resynchronization, audio reproduction stopping times 1 and 2, i.e., Audio Stop PTM 1 in VOB 
(VOB_A STP_PTM1), and Audio Stop PTM2 in VOB (VOB A_STP^PTM2),. indicating the time at which the audio is 
to be paused; and audio reproduction stopping periods 1 and 2, i.e., Audio Gap Length 1 in VOB (VOB_A_GAPJ_EN1) 
and Audio Gap Length 2 in VOB (VOB_A_GAP_LEN2), indicating for how long the audio is to be paused, are also 
declared in the DSI packet. Note that these times are specified at the system clock precision (90 kHz). 
[0188] The Angle Information for Seamless playback (SML_AGLI) declares the read start address when the angle 
is changed. Note that this field is valid when seamless, multi-angle control is enabled. This address is also expressed 
as the number of sectors from the navigation pack NV of that VOBU. Because there are nine or fewer angles, there 
are nine angle address declaration cells: Destination Address of Angle Cell #1 for Seamless playback 
(SML_AGL_C1_DSTA) to Destination Address of Angle Cell #9 for Seamless playback (SML_AGL_C9_DSTA). 
[0189] Note also that each title is edited in video object (VOB) units. Interleaved video objects (interleaved title editing 
units) are referenced as "VOBS"; and the encoded range of the source data is the encoding unit. 

DVD encoder 

[0190] A preferred embodiment of a digital video disk system authoring encoder ECD in which the multimedia bit- 
25 stream authoring system according to the present invention is applied to a digital video disk system is described below 
and shown in Fig. 25. It will be obvious that the authoring encoder ECD applied to the digital video disk system, referred 
to below as a DVD encoder, is substantially identical to the authoring encoder EC shown in Fig. 2. The basic difference 
between these encoders is the replacement in the DVD encoder ECD of the video zone formatter 1 300 of the authoring 
encoder EC above with a VOB buffer 1000 and formatter 1100. It will also be obvious that the bitstream encoded by 
30 this DVD encoder ECD is recorded to a digital video disk medium M. The operation of this DVD encoder ECD is 
therefore described below in comparison with the authoring encoder EC described above. 

[0191] As in the above authoring encoder EC, the encoding system controller 200 generates control signals St9, 
St11, St13, St21, St23, St25, St33, and St39 based on the scenario data St7 describing the user-defined editing in- 
structions input from the scenario editor 1 00, and controls the video encoder 300, sub-picture encoder 500, and audio 
35 encoder 700 in the DVD encoder ECD. Note that the user-defined editing instructions in the DVD encoder ECD are a 
superset of the editing instructions of the authoring encoder EC described above. 

[0192] Specifically, the user-defined editing instructions (scenario data St7) in the DVD encoder ECD similarly de- 
scribe what source data is selected from all or a subset of the source data containing plural titles within a defined time 
period, and how the selected source data is reassembled to reproduce the scenario (sequence) intended by the user. 

40 The scenario data St7 of the DVD encoder ECD, however, further contains such information as: the number of streams 
contained in the editing units, which are obtained by splitting a multi-title source stream into blocks at a constant time 
interval; the number of audio and sub-picture data cells contained in each stream, and the sub-picture display time 
and period; whether the title is a multi-rated title enabling parental lock control; whether the user content is selected 
from plural streams including, for example, multiple viewing angles; and the method of connecting scenes when the 

45 angle is switched among the multiple viewing angles. 

[0193] The scenario data St7 of the DVD encoder ECD also contains control information on a video object VOB unit 
basis. This information is required to encode the media source stream, and specifically includes such information as 
whether there are multiple angles or parental control features. When multiple angle viewing is enabled, the scenario 
data St7 also contains the encoding bit rate of each stream considering data interleaving and the disk capacity, the 

50 start and end times of each control, and whether a seamless connection should be made between the preceding and 
following streams. 

[0194] The encoding system controller 200 extracts this information from the scenario data St7, and generates the 
encoding information table and encoding parameters required for encoding control. The encoding information table 
and encoding parameters are described with reference to Figs. 27, 28, and 29 below. 
55 [01 95] The stream encoding data St33 contains the system stream encoding parameters and system encoding start 
and end timing values required by the DVD system to generate the VOBs. These system stream encoding parameters 
include the conditions for connecting one video object VOB with those before and after, the number of audio streams, 
the audio encoding information and audio Ids, the number of sub-pictures and the sub-picture Ids, the video playback 
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starting time information VPTS, and the audio playback starting time information APTS. 

[0196] The title sequence control signal St39 supplies the multimedia bitstream MBS formatting start and end timing 
information and formatting parameters declaring the reproduction control information and interleave information. 
[0197] Based on the video encoding parameter and encoding start/end timing signal St9, the video encoder 300 
5 encodes a specific part of the video stream St1 to generate an elementary stream conforming to the MPEG2 Video 
standard defined in ISO-13818. This elementary stream is output to the video stream buffer 400 as encoded video 
stream St1 5. 

[0198] Note that while the video encoder 300 generates an elementary stream conforming to the MPEG2 Video 
standard defined in ISO-13818, specific encoding parameters are input via the video encoding parameter signal St9, 
10 including the encoding start and end timing, bit rate, theencoding conditions for the encoding start and end, the material 
type, includingi whether the material is an NTSC or PAL video signal or telecine converted material, and whether the 
encoding mode is set for either open GOP or closed GOP encoding. 

[0199] The MPEG2 coding method is basically an interframe coding method using the correlation between frames 
for maximum signal compression, i.e., the frame being coded (the target frame) is coded by referencing frames before 

*5 and/or after the target frame. However, intra-coded frames, i. e. , frames that are coded based solely on the content 
of the target frame, are also inserted to avoid error propagation and enable accessibility from mid-stream (random 
access). The coding unit containing at least one intra-coded frame ("intra-frame") is called a group_of __pictures GOP. 
[0200] A group_of_pictures GOP in which coding is closed completely within that GOP is known as a "closed GOP. 
" A group_of_pictures GOP containing a frame coded with reference to a frame in a preceding or following (ISO-1 381 8 

20 DOES NOT LIMIT P- and B-picture CODING to referencing PAST frames) group_of_pictures GOP is an "open GOP. 
" It is therefore possible to playback a closed GOP using only that GOP. Reproducing an open GOP, however, also 
requires the presence of the referenced GOP, generally the GOP preceding the open GOP. 

[0201] The GOP is often used as the access unit. For example, the GOP may be used as the playback start point 
for reproducing a title from the middle, as a transition point in a movie, or for fast-forward play and other special repro- 
25 duction modes. High speed reproduction can be achieved in such cases by reproducing only the intra-frame coded 
frames in a GOP or by reproducing only frames in GOP units. 

[0202] Based on the sub-picture stream encoding parameter signal St11 , the sub-picture encoder 500 encodes a 
specific part of the sub-picture stream St3 to generate a variable length coded bitstream of bitmapped data. This 
variable length coded bitstream data is output as the encoded sub-picture stream St1 7 to the sub-picture stream buffer 
30 600. 

[0203] Based on the audio encoding parameter signal St13, the audio encoder 700 encodes a specific part of the 
audio stream St5 to generate the encoded audio data. This encoded audio data may be data based on the MPEG1 
audio standard defined in ISO-1 11 72 and the MPEG2 audio standard defined in ISO-13818, AC-3 audio data, or PCM 
(LPCM) data. Note that the methods and means of encoding audio data according to these standards are known and 
35 commonly available. 

[0204] The video stream buffer 400 is connected to the video encoder 300 and to the encoding system controller 
200. The video stream buffer 400 stores the encoded video stream St1 5 input from the video encoder 300, and outputs 
the stored encoded video stream St15 as the time-delayed encoded video stream St27 based on the timing signal 
St21 supplied from the encoding system controller 200. 

40 [0205] The sub-picture stream buffer 600 is similarly connected to the sub-picture encoder 500 and to the encoding 
system controller 200. The sub-picture stream buffer 600 stores the encoded sub-picture stream St1 7 input from the 
sub-picture encoder 500, and then outputs the stored encoded sub-picture stream St1 7 as time-delayed encoded sub- 
picture stream St29 based on the timing signal St23 supplied from the encoding system controller 200. 
[0206] The audio stream buffer 800 is similarly connected to the audio encoder 700 and to the encoding system 

45 controller 200. The audio stream buffer 800 stores the encoded audio stream St1 9 input from the audio encoder 700, 
and then outputs the encoded audio stream St1 9 as the time-delayed encoded audio stream St31 based on the timing 
signal St25 supplied from the encoding system controller 200. 

[0207] The system encoder 900 is connected to the video stream buffer 400, sub-picture stream buffer 600, audio 
stream buffer 800, and the encoding system controller 200, and is respectively supplied thereby with the time-delayed 
50 encoded video stream St27, time-delayed encoded sub-picture stream St29, time-delayed encoded audio stream St31 , 
and the system stream encoding parameter data St33. Note that the system encoder 900 is a multiplexer that multi- 
plexes the time-delayed streams St27, St29, and St31 based on the stream encoding data St33 (timing signal) to 
generate title editing units (VOBs) St35. 

[0208] The VOB buffer 1 000 temporarily stores the video objects VOBs produced by the system encoder 900. The 
55 formatter 1100 reads the delayed video objects VOB from the VOB buffer 1000 based on the title sequence control 
signal St39 to generate one video zone VZ, and adds the volume file structure VFS to generate the edited multimedia 
stream data St43. 

[0209] The multimedia bitstream MBS St43 edited according to the user-defined scenario is then sent to the recorder 
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1200. The recorder 1200 processes the edited multimedia stream data St43 to the data stream St45 format of the 
recording medium M, and thus records the formatted data stream St45 to the recording medium M. 

DVD decoder 

5 

[0210] A preferred embodiment of a digital video disk system authoring decoder DCD in which the multimedia bit- 
stream authoring system of the present invention is applied to a digital video disk system is described below and shown 
in Fig. 26. The authoring decoder DCD applied to the digital video disk system, referred to below as a DVD decoder 
DCD, decodes the multimedia bitstream MBS edited using the DVD encoder ECD of the present invention, and rec- 

10 reates the content of each title according to the user-defined scenario. It will also be obvious that the multimedia 
bitstream St45 encoded by this DVD encoder ECD is recorded to a digital video disk medium M. 
[0211] The basic configuration of the DVD decoder DCD according to this embodiment is the same as that of the 
authoring decoder DC shown in Fig. 3. The differences are that a different video decoder 3801 (shown as 3800 in Fig. 
26) is used in place of the video decoder 3800, and a reordering buffer 3300 and selector 3400 are disposed between 

15 the video decoder 3801 and synthesizer 3500. 

[0212] Note that the selector 3400 is connected to the synchronizer 2900, and is controlled by a switching signal 
St103. 

[021 3] The operation of this DVD decoder DCD is therefore described below in comparison with the authoring decoder 
DC described above. 

20 [0214] As shown in Fig. 26, the DVD decoder DCD comprises a multimedia bitstream producer 2000, scenario se- 
lector 21 00, decoding system controller 2300, stream buffer 2400, system decoder 2500, video buffer 2600, sub-picture 
buffer 2700, audio buffer 2800, synchronizer 2900, video decoder 3801 , reordering buffer 3300, sub-picture decoder 
3100, audio decoder 3200, selector 3400, synthesizer 3500, video data output terminal 3600, and audio data output 
terminal 3700. 

25 [0215] The bitstream producer 2000 comprises a recording media drive unit 2004 for driving the recording medium 
M ; a reading head 2006 for reading the information recorded to the recording medium M and producing the binary read 
signal St57; a signal processor 2008 for variously processing the read signal St57to generate the reproduced bitstream 
St61 ; and a reproduction controller 2002. 

[0216] The reproduction controller 2002 is connected to the decoding system controller 2300 from which the multi- 

30 media bitstream reproduction control signal St53 is supplied, and in turn generates the reproduction control signals 
St55 and St59 respectively controlling the recording media drive unit (motor) 2004 and signal processor 2008. 
[0217] So that the user-defined video, sub-picture, and audio portions of the multimedia title edited by the authoring 
encoder EC are reproduced, the authoring decoder DC comprises a scenario selector 2100 for selecting and repro- 
ducing the corresponding scenes (titles). The scenario selector 21 00 then outputs the selected titles as scenario data 

35 to the DVD decoder DCD. 

[0218] The scenario selector 2100 preferably comprises a keyboard, CPU, and monitor. Using the keyboard, the 
user then inputs the desired scenario based on the content of the scenario input by the DVD encoder ECD. Based on 
the keyboard input, the CPU generates the scenario selection data St51 specifying the selected scenario. The scenario 
selector 21 00 is connected to the decoding system controller 2300 by an infrared communications device, for example, 

40 and inputs the generated scenario selection data St51 to the decoding system controller 2300. 

[021 9] The stream buffer 2400 has a specific buffer capacity used to temporarily store the reproduced bitstream St61 
input from the bitstream producer 2000, extract the volume file structure VFS, the initial synchronization data SCR 
(system clock reference) in each pack, and the VOBU control information (DSI) in the navigation pack NV, to generate 
the bitstream control data St63. The stream buffer 2400 is also connected to. the decoding system controller 2300, to 

45 which it supplies the generated bitstream control data St63. 

[0220] Based on the scenario selection data St51 supplied by the scenario selector 2100, the decoding system 
controller 2300 then generates the bitstream reproduction control signal St53 controlling the operation of the bitstream 
producer 2000. The decoding system controller 2300 also extracts the user-defined playback instruction data from the 
bitstream reproduction control signal St53, and generates the decoding information table required for decoding control. 

so This decoding information table is described further below with reference to Figs. 62 and 63. The decoding system 
controller 2300 also extracts the title information recorded to the optical disk M from the file data structure area FDS 
of the bitstream control data St63 to generate the title information signal St200. Note that the extracted title information 
includes the video manager VMG, VTS information VTSI, the PGC information entries C_PBI #j, and the cell presen- 
tation time C_PBTM. 

55 [0221] Note that the bitstream control data St63 is generated in pack units as shown in Fig. 1 9, and is supplied from 
the stream buffer 2400 to the decoding system controller 2300, to which the stream buffer 2400 is connected. 
[0222] The synchronizer 2900 is connected to the decoding system controller 2300 from which it receives the system 
clock reference SCR contained in the synchronization control data St81 to set the internal system clock STC and supply 
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the reset system clock St79 to the decoding system controller 2300. 

[0223] Based on this system clock St79, the decoding system controller 2300 also generates the stream read signal 
St65 at a specific interval and outputs the read signal St65 to the stream buffer 2400. Note that the read unit in this 
case is the pack. 

5 [0224] The method of generating the stream read signal St65 is described next. 

[0225] The decoding system controller 2300 compares the system clock reference SCR contained in the stream 
control data extracted from the stream buffer 2400 with the system clock St79 supplied from the synchronizer 2900, 
and generates the read request signal St65 when the system clock St79 is greater than the system clock reference 
SCR of the bitstream control data St63. Pack transfers are controlled by executing this control process on a pack unit. 

10, [0226] Based on the scenario selection data St51, the decoding system controller 2300 generates the decoding 
signal St69 defining the stream Ids for the video, sub-picture, and audio bitstreams corresponding to the selected 
scenario, and outputs to the system decoder 2500. 

[0227] When a title contains plural audio tracks, e.g. audio tracks in Japanese, English, French, and/or other lan- 
guages, and plural sub-picture tracks for subtitles in Japanese, English, French, and/or other languages, for example, 

15 a discrete ID is assigned to each of the language tracks. As described above with reference to Fig. 1 9, a stream ID is 
assigned to the video data and MPEG audio data, and a substream ID is assigned to the sub-picture data, AC-3 audio 
data, linear PCM data, and navigation pack NV information. While the user need never be aware of these ID numbers, 
the user can select the language of the audio and/or subtitles using the scenario selector 21 00. If English language 
audio is selected, for example, the ID corresponding to the English audio track is sent to the decoding system controller 

20 2300 as scenario selection data St51 . The decoding system controller 2300 then adds this ID to the decoding signal 
St69 output to the system decoder 2500. 

[0228] Based on the instructions contained in the decoding signal St69, the system decoder 2500 respectively outputs 
the video, sub-picture, and audio bitstreams input from the stream buffer 2400 to the video buffer 2600, sub-picture 
buffer 2700, and audio buffer 2800 as the encoded video stream St71 , encoded sub-picture stream St73, and encoded 
25 audio stream St75. Thus, when the stream ID input from the scenario selector 2100 and the pack ID input from the 
stream buffer 2400 match, the system decoder 2500 outputs the corresponding packs to the respective buffers (i.e., 
the video buffer 2600, sub-picture buffer 2700, and audio buffer 2800). 

[0229] The system decoder 2500 detects the presentation time stamp PTS and decoding time stamp DTS of the 
smallest control unit in each bitstream St67 to generate the time information signal St77. This time information signal 
30 st77 is supplied to the synchronizer 2900 through the decoding system controller 2300 as the synchronization control 
data St81. 

[0230] Based on this synchronization control data St81 , the synchronizer 2900 determines the decoding start timing 
whereby each of the bitstreams will be arranged in the correct sequence after decoding, and then generates and inputs 
the video stream decoding start signal St89 to the video decoder 3801 based on this decoding timing. The synchronizer 
35 2900 also generates and supplies the sub-picture decoding start signal St91 and audio stream decoding start signal 
St93 to the sub-picture decoder 3100 and audio decoder 3200, respectively. 

[0231] The video decoder 3801 generates the video output request signal St84 based on the video stream decoding 
start signal St89, and outputs to the video buffer 2600. In response to the video output request signal St84, the video 
buffer 2600 outputs the video stream St83 to the video decoder 3801 . The video decoder 3801 thus detects the pres- 
40 entation time information contained in the video stream St83, and disables the video output request signal St84 when 
the length of the received video stream St83 is equivalent to the specified presentation time. A video stream equal in 
length to the specified presentation time is thus decoded by the video decoder 3801 , which outputs the reproduced 
video signal St95 to the reordering buffer 3300 and selector 3400. 

[0232] Because the encoded video stream is coded using the interframe correlations between pictures, the coded 
45 order and display order do not necessarily match on a frame unit basis. The video cannot, therefore, be displayed in 
the decoded order. The decoded frames are therefore temporarily stored to the reordering buffer 3300. The synchro- 
nizer 2900 therefore controls the switching signal St1 03 so that the reproduced video signal St95 output from the video 
decoder 3800 and the reordering buffer output St97 are appropriately selected and output in the display order to the 
synthesizer 3500. 

50 [0233] The sub-picture decoder 3100 similarly generates the sub-picture output request signal St86 based on the 
sub-picture decoding start signal St91 , and outputs to the sub-picture buffer 2700. In response to the sub-picture output 
request signal St86, the sub-picture buffer 2700 outputs the sub-picture stream St85 to the sub-picture decoder 31 00. 
Based on the presentation time information contained in the sub-picture stream St85, the sub-picture decoder 3100 
decodes a length of the sub-picture stream St85 corresponding to the specified presentation time to reproduce and 

55 supply to the synthesizer 3500 the sub-picture signal St99. 

[0234] The synthesizer 3500 superimposes the selector 3400 output with the sub-picture signal St99 to generate 
and output the video signal St105 to the video data output terminal 3600. 

[0235] The audio decoder 3200 generates and supplies to the audio buffer 2800 the audio output request signal 
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St88 based on the audio stream decoding start signal St93. The audio buffer 2800 thus outputs the audio stream St87 
to the audio decoder 3200. The audio decoder 3200 decodes a length of the audio stream St87 corresponding to the 
specified presentation time based on the presentation time information contained in the audio stream St87, and outputs 
the decoded audio stream St101 to the audio data output terminal 3700. 
5 [0236] It is thus possible to reproduce a user-defined multimedia bitstream MBS in real-time according to a user- 
defined scenario. More specifically, each time the user selects a different scenario, the DVD decoder DCD is able to 
reproduce the title content desired by the user in the desired sequence by reproducing the multimedia bitstream MBS 
corresponding to the selected scenario. 

[0237] It should be noted that the decoding system controller 2300 may supply the title information signal St200 to 
10 the scenario selector 2100 by means of the infrared communications device.mentioned above or another means. 
Interactive scenario selection controlled by the user can also be made possible by the scenario selector 21 00 extracting 
the title information recorded to the optical disk M from the file data structure area FDS of the bitstream control data 
St63 contained in the title information signal St200, and displaying this title information on a display for user selection. 
[0238] Note, further, that the stream buffer 2400, video buffer 2600, sub-picture buffer 2700, audio buffer 2800, and 
15 reordering buffer 3300 are expressed above and in the figures as separate entities because they are functionally 
different. It will be obvious, however, that a single buffer memory can be controlled to provide the same discrete func- 
tionality by time-share controlled use of a buffer memory with an operating speed plural times faster than the read and 
write rates of these separate buffers. 

20 Multi-scene control 

[0239] The concept of multiple angle scene control according to the present invention is described below with refer- 
ence to Fig. 21. As described above, titles that can be played back with numerous variations are constructed from 
basic scene periods containing data common to each title, and multi-scene periods comprising groups of different 
25 scenes corresponding to the various scenario requests, in Fig. 21 , scenes 1 , 5, and 8 are the common scenes of the 
basic scene periods. The multi-angle scenes (angles 1 , 2, and 3) between scenes 1 and 5, and the parental locked 
scenes (scenes 6 and 7) between scenes 5 and 8, are the multi-scene periods. 

[0240] Scenes taken from different angles, i.e., angles 1 , 2, and 3 in this example, can be dynamically selected and 
reproduced during playback in the multi-angle scene period. In the parental locked scene period, however, only one 
30 of the available scenes, scenes 6 and 7, having different content can be selected, and must be selected statically 
before playback begins. 

[0241] Which of these scenes from the multi-scene periods is to be selected and reproduced is defined by the user 
operating the scenario selector 21 00 and thereby generating the scenario selection data St51 . In scenario 1 in Fig. 21 
the user can freely select any of the multi-angle scenes, and scene 6 has been preselected for output in the parental 
35 locked scene period. Similarly in scenario 2, the user can freely select any of the multi-angle scenes, and scene 7 has 
been preselected for output in the parental locked scene period. 

[0242] With reference to Figs. 30 and 31 , furthermore, the contents of the program chain information VTS_PGCI is 
described. In Fig. 30, the case that a scenario requested by the user is shown with respect to a VTSI data construction. 
The scenario 1 and scenario 2 shown in Fig. 21 are described as program chain information VTS_PGC#1 and 
40 VTS_PGC#2. VTS_PGC#1 describing the scenario 1 consists of cell playback information C_PBI#1 corresponding to 
scene 1 , C_PBI#2, C_PBI#3, and C_PBI#4 within a multi-angle cell block, C_PBI#5 corresponding to scene 5, C_PBI#6 
corresponding to scene 6, and C_PBI#7 corresponding to scene 8. 

[0243] VTS_PGCI#2 describing the scenario 2 consists of cell playback information C_PBI#1 corresponding to scene 
1, C_PBI#2, C_PBI#3, and C_PBI#4 within a multi-angle cell block corresponding to a multi-angle scene, C_PBI#5 
45 corresponding to scene 5, C_PBI#6 corresponding to scene 7, and C_PBI#7 corresponding to scene 8. According to 
the digital video system data structure, a scene which is a control unit of a scenario is described as a cell which is a 
unit thereunder, thus a scenario requested by a user can be obtained. 

[0244] In Fig. 31 , the case that a scenario requested by the user shown in Fig. 21 is shown with respect to a VOB 
data construction VTSTT_VOBS. As specifically shown in Fig. 31 , the two scenarios 1 and 2 use the same VOB data 
50 in common. With respect to a single scene commonly owned by each scenario, VOB#1 corresponding to scene 1, 
VOB#5 corresponding to scene 5, and VOB#8 corresponding to scene 8 are arranged in non-interleaved block which 
is the contiguous block. 

[0245] With respect to the multi-angle data commonly owned by scenarios 1 and 2, one angle scene data is con- 
structed by a single VOB. Specifically speaking, angle 1 is constructed by VOB#2, and angle 2 is constructed by 
55 VOB#3, angle 3 is constructed by VOB#4. Thus constructed multi-angle data is formed as the interleaved block for 
the sake of switching between each angle and seamless reproduction of each angle data. Scenes 6 and 7 peculiar to 
scenarios 1 and 2, respectively, are formed as the interleaved block for the sake of seamless reproduction between 
common scenes before and behind thereof as well as seamless reproduction between each scene. 
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[0246] As described in the above, the user's requesting scenario shown in Fig. 21 can be realized by utilizing the 
video title playback control information shown in Fig. 30 and the title playback VOB data structure shown in Fig. 31 . 

Seamless playback 

5 

[0247] The seamless playback capability briefly mentioned above with regard to the digital video disk system data 
structure is described below. Note that seamless playback refers to the reproduction in a digital video disk system of 
multimedia data including video, audio, and sub-picture data without intermittent breaks in the data or information 
between basic scene periods, between basic scene periods and multi-scene periods, and between multi-scene periods. 

w [0248] Hardware factors contributing to intermittent playback of this data and title content include decoder underflow, 
i.e., an imbalance between the source data input speed and the decoding speed of the input source data. 
[0249] Other factors relate to the properties of the playback data. When the playback data is data that must be 
continuously reproduced for a constant time unit in order for the user to understand the content or information, e.g., 
audio data, data continuity is lost when the required continuous presentation time cannot be assured. Reproduction 

15 of such information whereby the required continuity is assured is referred to as "contiguous information reproduction, 
" or "seamless information reproduction." Reproduction of this information when the required continuity cannot be 
assured is referred to as "non-continuous information reproduction," or "non-seamless information reproduction." It is 
obvious that continuous information reproduction and non-continuous information reproduction are, respectively, seam- 
less and non-seamless reproduction. 

20 [0250] Note that seamless reproduction can be further categorized as seamless data reproduction and seamless 
information reproduction. Seamless data reproduction is defined as preventing physical blanks or interruptions in the 
data playback (intermittent reproduction) as a result of a buffer underflow state, for example. Seamless information 
reproduction is defined as preventing apparent interruptions in the information when perceived by the user (intermittent 
presentation) when recognizing information from the playback data where there are no actual physical breaks in the 

25 data reproduction. 

Details of Seamless playback 

[0251 ] The specific method enabling seamless reproduction as thus described is described later below with reference 
30 to Figs. 23 and 24. 

Interleaving 

[0252] The DVD data system streams described above are recorded using an appropriate authoring encoder EC as 
35 a movie or other multimedia title on a DVD recording medium. Note that the following description refers to a movie as 
the multimedia title being processed, but it will be obvious that the invention shall not be so limited. 
[0253] Supplying a single movie in a format enabling the movie to be used in plural different cultural regions or 
countries requires the script to be recorded in the various languages used in those regions or countries. It may even 
necessitate editing the content to conform to the mores and moral expectations of different cultures. Even using such 
40 a large-capacity storage system as the DVD system, however, it is necessary to reduce the bit rate, and therefore the 
image quality, if plural full-length titles edited from a single common source title are recorded to a single disk. This 
problem can be solved by recording the common parts of plural titles only once, and recording the segments different 
in each title for each different title only. This method makes it possible to record plural titles for different countries or 
cultures to a single optical disk without reducing the bit rate, and, therefore, retaining high image quality. 
45 [0254] As shown in Fig. 21 , the titles recorded to a single optical disk contain basic scene periods of scenes common 
to all scenarios, and multi-scene periods containing scenes specific to certain scenarios, to provide parental lock control 
and multi-angle scene control functions. 

[0255] In the case of the parental lock control function, titles containing sex scenes, violent scenes, or other scenes 
deemed unsuitable for children, i.e., so-called "adult scenes," are recorded with a combination of common scenes, 
50 adult scenes, and children's scenes. These title streams are achieved by arraying the adult and children's scenes to 
multi-scene periods between the common basic scene periods. 

[0256] Multi-angle control can be achieved in a conventional single-angle title by recording plural multimedia scenes 
obtained by recording the subjects from the desired plural camera angles to the multi-scene periods arrayed between 
the common basic scene periods. Note, however, that while these plural scenes are described here as scenes recorded 
55 from different camera angles (positions), it will be obvious that the scenes may be recorded from the same camera 
angle but at different times, data generated by computer graphics, or other video data. 

[0257] When data is shared between different scenarios of a single title, it is obviously necessary to move the laser 
beam LS from the common scene data to the non-common scene data during reproduction, i.e., to move the optical 
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pickup to a different position on the DVD recording medium RC1 . The problem here is that the time required to move 
the optical pickup makes it difficult to continue reproduction without creating breaks in the audio or video, i.e., to sustain 
seamless reproduction. This problem can be theoretically solved by providing a track buffer (stream buffer 2400) to 
delay data output an amount equivalent to the worst access time. In general, data recorded to an optical disk is read 
5 by the optical pickup, appropriately processed, and temporarily stored to the track buffer. The stored data is subse- 
quently decoded and reproduced as video or audio data. 

Specific Problem in Interleaving 

w [0258] The operation of.the stream buffer 2400, or a track buffer, in this DVD system is described briefly below. . . „ 
[0259] Because rotational speed control of the optical disk drive cannot instantaneously adjust the disk speed, the 
input to the stream buffer 2400, i.e., the transfer rate Vr from the optical disk, is a basically constant rate. Output from 
the track buffer, i.e., the transfer rate Vo to the decoder, however, varies according to the image quality, user intent, 
and even the variable bit rate of the compressed video data in the DVD system. The from-disk transfer rate Vr is 

15 therefore constant at approximately 1 1 Mbps in the DVD system, while the transfer rate Vo from the buffer is variable 
to a maximum 10 Mbps. 

[0260] Because of this gap between the input and output transfer rates Vr and Vo, the stream buffer 2400 will overflow 
if data is transferred continuously from disk. Therefore, to prevent the stream buffer 2400 of the reproduction apparatus 
overflowing, data is transferred intermittently from the disk. The stream buffer is controlled during normal continuous 

20 reproduction to always be in a near-overflow state. 

[0261 ] It is therefore possible by using a stream buffer 2400 to sustain continuous, uninterrupted data output (repro- 
duction) even if there is a slight break in data reading as a result of the read head (optical pickup) 2006 jumping to 
move between logic sectors LS on the disk. In an actual reproduction apparatus, however, the jump time may vary 
between 200 msec and 2 sec depending upon the distance and position on the disk M. While it is possible to use a 

25 stream buffer 2400 with capacity sufficient to absorb this jump time, the compression bit rate is an average 4 - 5 Mbps 
and a maximum 10 Mbps when using a large-capacity optical disk M storing high quality images. Assuring seamless 
reproduction when jumping from any disk position to any other with this system imposes significant memory require- 
ments, resulting in an expensive decoder DC. Because the amount of memory that can be built in to the decoder DC 
is therefore limited if a cost-effective product is to be marketed, other limitations are imposed, e. g., limiting the jump 

30 time to a time whereby data can be continuously reproduced. 

[0262] The relationship between the accumulated data quantity in the stream buffer 2400 and the operating mode 
of the reading head 2006 is shown in Fig. 32. In Fig. 32, period Tr is the period during which the optical pickup reads 
data from the optical disk RC, and period Tj is the jump period during which the optical pickup moves between logic 
sectors. Line L1 expresses the change in data volume Vd stored in the stream buffer 2400 during the data read period 

35 Tr. Line L2 expresses the change in data volume Vd stored in the stream buffer 2400 during the jump period Tj. 

[0263] During the data read period Tr, the reading head 2006 reads data from the optical disk M at the transfer rate 
Vr, and stores the read data directly to the stream buffer 2400. The stream buffer 2400 supplies the data to the decoders 
3801 , 31 00, and 3200 at the transfer rate Vo. As a result, data accumulates in the stream buffer 2400 during the data 
read period Tr at the rate Vr-Vo, i.e., the difference between these two transfer rates. 

40 [0264] Because the reading head 2006 is moving to access a different disk position during the jump period Tj, data 
read from the optical disk M cannot be stored to the stream buffer 2400. Data continues to be output from the stream 
buffer 2400 to the decoders 3801 , 31 00, and 3200 during this period, however, and the accumulated data volume Vd 
remaining in the stream buffer 2400 decreases at the rate at which data is transferred to the decoders, i.e., transfer 
rate Vo. 

45 [0265] Note that while the transfer rate Vo to the decoders is shown in Fig. 32 as a constant rate, data transfers to 
the decoders actually occur intermittently because the decoding time varies according to the data type. These differ- 
ences are omitted to simply this description of the buffer underflow concept. This is equivalent to the reading head 
2006 reading data intermittently during the jump period Tj, even through the reading head 2006 continuously reads 
from the optical disk M at a constant linear velocity (CLV). The slopes L1 and L2 of lines L1 and L2 above can be 

so expressed by the following equations. 

L1 = Vr-Vo [1] 



L2 = Vo [2] 
[0266] Underflow occurs, and decoding stops, when the jump period Tj is long enough for stream buffer 2400 to 
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empty. If the jump period Tj is held to less than the time required for the stream buffer 2400 to empty, however, decoding 
can continue without data interruptions. This time at which the reading head 2006 can jump to another disk position 
without a data underflow state occurring in the stream buffer 2400 is referred to as the jump-enabled time. 
[0267] Note that while physical movement of the reading head 2006 has been described as a factor causing a data 
5 underflow state in the stream buffer 2400, the following factors also contribute. Note, further, that this data underflow 
state can be simply characterized as a condition in which data input to the track buffer (stream buffer 2400) does not 
keep pace with data output from the track buffer. 

a) The buffer is too small relative to the decoding speed of the decoder. 
10 . . b) The sizes of the individual input units of the plural VOB types carried in the reproduced bitstream St61 input to 
the stream buffer 2400 from the bitstream producer 2000 are inappropriate for the buffer size, 
c) The sequence of the individual input units of the plural VOB types carried in the reproduced bitstream St61 is 
inappropriate for the decoding speed, resulting in the next data unit to be decoded not being received in sufficient 
time while decoding a particular data unit. 

15 

[0268] When the reproducing apparatus of the digital video disk system is configured with a disk read rate of 11 
Mbps, a maximum AV data compression rate of 10 Mbps, and a track buffer (stream buffer 2400) capacity of 4 Mbits, 
for example, a data underflow state will occur. To prevent this data underflow state from occurring during the jump 
period Tj in a reproduction apparatus thus configured with slight overflow control maintained during normal continuous 
20 playback, a maximum jump-enabled time of 400 msec can be assured even in a worst-case scenario where 1 0 Mbps 
of AV data is reproduced during the jump period. 

[0269] Note that a jump-enabled time of 400 msec is a realistic value that can be achieved in actual reproduction 
apparatuses. In such an apparatus the distance that can be jumped during a 400 msec period is approximately 500 
tracks. It follows that the jump-enabled time can also be defined in data quantity terms, specifically the volume of the 
25 sequential data stream that the reading head 2006 can move through within the jump-enabled time. The data volume 
equivalent to a 400 msec jump-enabled time is approximately 250 Mbits. It will also be obvious that the actual distance 
in sector and track units on the recording medium can also be easily obtained from the data volume defined as the 
jumpable distance based on the recording method and density of the recording medium. 

[0270] A 250-Mbit jumpable distance is equivalent to a 50 sec. presentation time assuming AV data reproduced at 
30 an average 5 Mbits/sec. With even higher quality AV data, the presentation time is less than 50 sec. In movies and 
other data streams edited to remove particular scenes due to cultural, educational, or other reasons, the excised scenes 
are typically long, usually ranging from 2 to 5 minutes, and even as long as 10 minutes. If the excised scene lasts 5 
minutes, for example, with the above reproduction apparatus, it is not possible to uninterruptedly connect the scenes 
before and after an excised scene if the before scene, excised scene, and after scene are recorded to disk in a single 
35 contiguous series. More specifically, the data corresponding to a continuous 5 minute scene cannot be jumped in a 
single jump operation. 

[0271] There are also cases in which the AV data compression rate, i.e., the consumption rate Vo of data from the 
track buffer, approaches 1 0 Mbps and non-occurrence of a buffer underflow state cannot be assured, even when the 
excised scene data is jumped using a jump period Tj exceeding 400 msec. While such a data underflow state can 

40 conceivably be avoided by recording two AV data scenarios, i.e., excised (edited) and un-excised (unedited) scenarios, 
to disk, this method does not make effective use of the available disk capacity. If it is also necessary to record duplicate 
data for much presentation time, the AV data will be low quality and it will be difficult to satisfy most user expectations. 
[0272] The concept of sharing data between plural titles is described below with reference to Fig. 33 wherein TL1 
and TL2 refer to the data content of a first and second title, respectively. Specifically, the first title TL1 consists of data 

45 DbA, DbB, and DbD reproduced contiguously during time T, and the second title TL2 similarly consists of data DbA, 
DbC, and DbD reproduced contiguously during time T Note that data DbA, DbB, DbC, and DbD are video objects VOB 
with display times T1 , T2, T2, and T3, respectively. When two titles TL1 and TL2 thus comprised are recorded to disk, 
the data stream is structured with data DbA and DbD as shared data of both titles TL1 and TL2, and data DbB and 
DbC as the discrete data of titles TL1 and TL2, respectively, selectively reproduced during selection period T2 as shown 

so by title TL1_2. Note that while there appears to be a time gap between the data blocks in Fig. 33, this is simply to 
facilitate the use of arrows in the figure for easier description of the data reproduction paths. There is no actual time 
gap during playback. 

[0273] The format whereby these data blocks DbA, DbB, DbC, and DbD are recorded to disk in title TL1_2 to enable 
contiguous reproduction of the data is shown in Fig. 34. In principle, data blocks DbA, DbB, DbC, and DbD composing 
55 a single contiguous title are recorded to contiguous segments of the data recording track TR (Fig. 9). Thus, data DbA, 
DbB, and DbD of first title TL1 are first recorded contiguously, and are followed by data DbC unique to the second title 
TL2. With this data format, the first title TL1 can be contiguously reproduced without breaks in the title content, i.e., 
seamlessly, by means of controlling the reading head 2006 to access data DbA, DbB, and DbD synchronized to play- 
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back periods T1 , T2, and T3, respectively. 

[0274] To reproduce the second title TL2, however, the reading head 2006 must jump past data DbB and DbD after 
reproducing data DbA in period T1 , and must reach data DbC before the start of period T2 as shown by arrow Sq2a 
in Fig. 34. Then after reproducing data DbC, the reading head 2006 must jump back to data units DbC and DbD as 
5 shown by arrow Sq2b, and reach data DbD before the start of period T3. Because of the time required to move the 
reading head 2006 between data blocks, it is not possible to assure seamless reproduction between data DbA and 
DbC, and between DbC and DbD, unless the distance traveled between these data blocks requires less time than the 
time required for a data underflow state to occur in the stream buffer 2400. 

10 Definition of Interleaving 

[0275] To thus enable the user to selectively excise scenes and choose from among plural scenes, a state wherein 
non-selected scene data is recorded inserted between common scene data and selective scene data necessarily 
occurs because the data units associated with individual scenes are contiguously recorded to the recording tracks of 
15 the recording medium. If data is then read in the recorded sequence, non-selected scene data must be accessed 
before accessing and decoding the selected scene data, and seamless connections with the selected scene is difficult. 
The excellent random access characteristics of the digital video disk system, however, make seamless connections 
with the selected scenes possible. 

[0276] In other words, by splitting scene-specific data into plural units of a specified data size, and interleaving plural 
20 split data units for different scenes in a predefined sequence that is recorded to disk within the jumping range whereby 
an data underflow state does not occur, it is possible to reproduce the selected scenes without data interruptions by 
intermittently accessing and decoding the data specific to the selected scenes using these split data units. Seamless 
data reproduction is thereby assured. 

25 Detailed Definition of Interleaving 

[0277] The seamless connection method and data splitting and arraying concepts of the present invention are de- 
scribed below using the track buffer input transfer rate Vr and data consumption rate Vo described above. 
[0278] Referring again to Fig. 32, the data consumption rate Vo is less than the transfer rate Vr (Vr > Vo). Using this 

30 difference, a known data volume is read at the transfer rate Vr, buffered to the track buffer and stored, and is then 
consumed (output to the decoder) while the optical pickup moves to the position of the next read data. To prevent a 
data underflow state even after multiple repetitions of this operation, split data units for a defined volume of data for 
each scene are distributed at an appropriate interval. This method of arraying data to assure seamless data reproduc- 
tion is called "interleaving," the split data units of a size sufficient for buffering to the track buffer are called "interleave 

35 units," and the interleave units after interleaving are called "interleaved units." 

[0279] When one scene is selected from plural scenes, interleave units are required for the plural video objects VOBs 
composing those plural scenes. Two interleaved units ILVU contiguous on the time-base and belonging to the selected 
scene are separated by one or more interleaved units ILVU belonging to another scene inserted therebetween. The 
distance between two time-base-contiguous interleaved units ILVU belonging to the same scene is defined as the 

^0 jnterleave distance. 

[0280] For example, 260 msec is required to move 10,000 sectors using an optical disk recording medium. If this 
1 0,000-sector movement of the optical pickup is the interleave unit distance, the size of the interleaved units ILVU can 
be determined from the difference between the track buffer input and output transfer rates Vr and Vo, and the size of 
the track buffer. Let us assume, for example, that compressed data is being output at a constant rate using an input 

45 transfer rate Vr of 11 Mbps and an output transfer rate Vo of 8 Mbps, and a track buffer capacity of 3 Mbits. If the 
distance between interleaved units ILVU is 1 0,000 sectors, there must be enough interleaved units ILVU to input to the 
track buffer to accumulate therein a 260-msec equivalent of playback data before the reading head moves. 
[0281] In general, 2080 Kbits is equivalent to 260 msec of playback data. To store this much data to the track buffer 
before moving between interleaved units ILVU, source data must be input to the track buffer for 0.7 sec. at the difference 

so between transfer rates Vr and Vo (= 2080 Kbit/(11-8) Mbits/sec). The time needed to read enough source data from 
the recording medium M to store sufficient data to the track buffer to provide for data consumption by the decoder while 
the optical pickup is moving to the target interleaved unit ILVU and can resume data reading is defined as the minimum 
accumulation read time. 

[0282] The minimum interleaved unit size, i.e., the amount of data that must be read, is therefore 7.7 Mbits. Converted 
55 to presentation time, this makes it possible to interleave units with a presentation time of 0. 96 sec. or greater with 
presentation time intervals less than 20 sec. between interleaved units. By reducing the system stream consumption 
bit rate, the minimum accumulation read time can be reduced, and the size of the interleaved units can be reduced. It 
is also possible to increase the jumpable time without changing the interleaved unit size. 
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[0283] One method of connecting scenes is shown in Fig. 35 where the following scenarios are possible: scene A 
connected to scene D; scene A connected to scene B, which is substituted for a certain time segment of scene D; and 
scene A connected to scene C, which is similarly substituted for a certain time segment of scene D but for a different 
time segment than scene B. In such cases, scene D is further blocked into scene segments (scenes) D-1, D-2, and 
5 D-3. The system stream corresponding to scenes B, D-1 , C, and D-2 is the output system stream output at transfer 
rate Vo (= 8 Mbps). Data is input to the track buffer at transfer rate Vr (= 11 Mbps) in the scene sequence B, D-1 , C, 
D-2; the data size of each scene is greater than the above value (= 0,96 sec,), and can be arrayed within the distance 
that can be jumped between connected scenes as described above. 

[0284] However, when scenes with the same starting points but different end points, e.g., scenes B, C, and D in this 
10 example, are interleaved, three different system streams become interleaved during the period corresponding to scene 
D-1 , two system streams become interleaved during the period corresponding to scene D-2, and processing tends to 
become complex. It is therefore more common when interleaving plural video objects VOB to interleave objects having 
the same starting and end points, thus simplifying processing. This can be accomplished as shown in Fig. 36 by copying 
and appending scene D-2 to scene C of Fig. 35, thereby unifying the points at which plural scenes diverge and connect, 
*5 j.e., the starting and end points, with plural interleaved video objects VOB. When interleaving branching and connecting 
scenes in the digital video disk system, therefore, the scenes are interleaved with the starting and end points thereof 
aligned. 

[0285] The interleaving concept is described in further detail below. 

[0286] The audio-video (AV) system stream described above is one example of interleaving with time information. 

20 This interleaving method arrays audio and video information having the same time-base so that data input to the buffer 
at approximately the same time is in physical proximity, and data units having approximately the same presentation 
time are arrayed in alternating sequence. When one scene can be replaced with another scene in titles such as movies, 
however, the time between these plural scenes often differs. When the AV system stream interleaving method described 
above is applied in such cases, the between-scene time difference can be absorbed by the buffer if it is within the 

25 jumpable time. If the between-scene time difference is greater than the jumpable time, however, the buffer cannot 
absorb the difference and seamless playback is not possible. 

[0287] If the track buffer size is increased to increase the amount of data that can be accumulated at one time, a 
longer jumpable time is obtained, and the interleave unit and sequence are more easily determined. When interactive 
operation enabling the user to select from among plural scenes (e.g., multi-angle scenes) with seamless playback, the 
30 presentation time of the previous angle stream is increased after the angle (stream) is changed if the interleaved units 
ILVU are made longer and the amount of data stored at one time is increased. This also results in more time being 
required for the displayed stream to change. 

[0288] Interleaving is thus the process of optimizing the system stream arrangement in data block units to prevent 
a data underflow state from occurring in the track buffer of the authoring decoder DC when the encoded data supplied 

35 from the stream source is consumed for decoding by the decoder. 

[0289] A major cause of this data underflow state is the mechanical movement of the optical pickup, though there 
are lesser factors such as the decoding speed of the communications (decoding) system. Mechanical movement of 
the optical pickup is a problem when the optical pickup scans the data recording track TR of the optical disk M to read 
data. Interleaving is also needed when recording the data to the data recording track TR of the optical disk M. The 

40 decoding speed of the communications system can also become a problem when the source stream is supplied directly 
as occurs with live broadcasts, cable television and other dedicated line transmissions, broadcast satellites and other 
radio wave transmissions, and other means whereby title content is not reproduced from a recording medium on the 
user's side. In such cases the transmitted source stream must be interleaved. 

[0290] Interleaving can thus be more specifically defined as the process whereby the data carried in a source stream 
45 comprising source data groups of plural contiguously input source data blocks is arranged in a particular sequence 
whereby the desired (targeted) source data can be intermittently and sequentially accessed to contiguously reproduce 
the information in the targeted source data. The duration of these intermittent breaks in the input of the target source 
data that is to be reproduced is therefore defined as the jump time of the interleave control. 

[0291] It should be noted, however, that an interleaving method for arraying on a random access disk movies and 
50 other common titles containing scene selections that may branch and recombine according to various scenarios in a 
manner whereby video objects VOB containing video data compressed with variable length coding has not been de- 
scribed. When arraying this type of data to disk, a certain amount of trial and error is required using the compressed 
data. This obviously makes it necessary to define an interleaving method for arranging plural video objects in a se- 
quence enabling seamless playback. 
55 [0292] When applied to a DVD system as described above, these video objects are distributed to positions (navigation 
packs NV) within a specific time-base range bounded by GOP units, the unit of video data compression. The GOP 
data length, however, may vary as a result of user-defined image manipulation processes or intra-f rame coding inserted 
for higher image quality. This means that the location of the management packs (navigation packs NV), which are 
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dependent upon the presentation time, can vary. This means that the point at which the scene angle changes or the 
optical pickup jumps to the next data block in the playback sequence is indefinite. Furthermore, if plural scene angles 
are interleaved together, the length of contiguously read data is indefinite even if the next jump point is known. Thus, 
the data end point position is known only after the next angle data is read, and switching the playback data is thus 
5 delayed. 

[0293] The present invention therefore provides a method and apparatus enabling seamless data reproduction using 
an optical disk having a data structure whereby data is shared between plural titles to efficiently utilize the available 
optical disk space, and a new function called "multi-angle scene reproduction" is achieved. The method and apparatus 
of the present invention are described below with reference to the accompanying figures. 

10 

Interleaved block and Interleave unit 

[0294] The interleaving method enabling seamless data reproduction according to the present invention is described 
below with reference to Fig. 24 and Fig. 37. Shown in Fig. 24 is a case from which three scenarios may be derived, i. 
15 e . , branching from one video object VOB-A to one of plural video objects VOB-B, VOB-C, and VOB-D, and then merging 
back again to a single video object VOB-E. The actual arrangement of these blocks recorded to a data recording track 
TR on disk is shown in Fig. 37. 

[0295] Referring to Fig, 37, VOB-A and VOB-E are video objects with independent playback start and end times, 
and are in principle arrayed to contiguous block regions. As shown in Fig. 24, the playback start and end times of VOB- 
20 B, VOB-C, and VOB-D are aligned during interleaving. The interleaved data blocks are then recorded to disk to a 
contiguous interleaved block region. The contiguous block regions and interleaved block regions are then written to 
disk in the track path Dr direction in the playback sequence. Plural video objects VOB, i.e., interleaved video objects 
VOBS, arrayed to the data recording track TR are shown in Fig. 37. 

[0296] Referring to Fig. 37, data regions to which data is continuously arrayed are called "blocks," of which there 
25 are two types: "contiguous block regions" in which VOB with discrete starting and end points are contiguously arrayed, 

and "interleaved block regions" in which plural VOB with aligned starting and end points are interleaved. The respective 

blocks are arrayed as shown in Fig. 38 in the playback sequence, i.e., block 1 , block 2, block 3, ... block 7. 

[0297] As shown in Fig. 38, the VTS title VOBS (VTSTT_VOBS) consist of blocks 1 - 7, inclusive. Block 1 contains 

VOB 1 alone. Blocks 2, 3, 5, and 7 similarly discretely contain VOBS 2, 3, 6, and 10. Blocks 2, 3, 5, and 7 are thus 
30 contiguous block regions. 

[0298] Block 4, however, contains VOB 4 and VOB 5 interleaved together, while block 6 contains VOB 7, VOB 8, 

and VOB 9 interleaved together. Blocks 4 and 6 are thus interleaved block regions. 

[0299] The internal data structure of the contiguous block regions is shown in Fig. 39 with VOB-i and VOB-j arrayed 
as the contiguous blocks in the VOBs. As described with reference to Fig. 16, VOB-i and VOB-j inside the contiguous 
35 block regions are further logically divided into cells as the playback unit. Both VOB-i and VOB-j in this figure are shown 
comprising three cells CELL #1 , CELL #2, and CELL #3. 

[0300] Each cell comprises one or more video object unit VOBU with the video object unit VOBU defining the bound- 
aries of the cell. Each cell also contains information identifying the position of the cell in the program chain PGC (the 
playback control information of the digital video disk system). More specifically, this position information is the address 
40 of the first and last VOBU in the cell. As also shown in Fig. 39, these VOB and the cells defined therein are also recorded 
to a contiguous block region so that contiguous blocks are contiguously reproduced. Reproducing these contiguous 
blocks is therefore no problem. 

[0301] The internal data structure of the interleaved block regions is shown in Fig. 40. In the interleaved block regions 
each video object VOB is divided into interleaved units ILVU, and the interleaved units ILVU associated with each VOB 

45 are alternately arrayed. Cell boundaries are defined independently of the interleaved units ILVU. For example, VOB- 
k is divided into four interleaved units ILVUkl , ILVUk2, ILVUk3, and ILVUk4, and are confined by a single cell CELL#k. 
VOB-k is likewise divided into four interleaved units ILVUml, ILVUm2, ILVUm3, and ILVUm4, and is confined by a 
single cell CELL#m. Note that instead of a single cell CELL#k or CELL#m, each of VOB-k and VOB-m can be divided 
into more than two cells. The interleaved units ILVU thus contains both audio and video data. 

so [0302] In the example shown in Fig. 40, the interleaved units ILVUkl , ILVUk2, ILVUk3, and ILVUk4, and ILVUml , 
ILVUm2, ILVUm3, and ILVUm4, from two different video objects VOB-k and VOB-m are alternately arrayed within a 
single interleaved block. By interleaving the interleaved units ILVU of two video objects VOB in this sequence, it is 
possible to achieve seamless reproduction branching from one scene to one of plural scenes, and from one of plural 
scenes to one scene. This interleaving process further enables seamless reproduction of scenes from various scenario 

55 threads in most cases. 
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An alternative for achieving Interleaving 

[0303] Seamless reproduction is even possible when there are three possible scenario threads as shown in Fig. 35, 
i.e., from scene A to scene B, and then to scene D-3 starting at some midpoint in scene D; from scene A to the beginning 
5 of scene D; and from scene A to scene C, and then to scene D-2 starting at a midpoint in scene D different from D-3. 
As shown in Fig. 36, it is also possible by connecting before and after scenes (scene D-2) to align the starting and end 
points according to the data structure of the present invention. Note that this process of copying scenes to align the 
starting and end points makes it possible to seamlessly reproduce even complicated threading of different scenes. 

10 . , An interleaving algorithm compatible with variable length coded data 

[0304] An example of an interleaving algorithm compatible with variable length coded data, i.e., video data, is de- 
scribed next. 

[0305] When plural VOB are interleaved, each VOB is basically divided into the same particular number of interleave 

*5 units. It is also possible to obtain the data volume (size) of each of this particular number of interleave units based on 
the bit rate of the VOB to be interleaved, the jump time and distance that can be jumped within this jump time, the track 
buffer capacity and the input rate Vr to the track buffer, and the position of the video object unit VOBU. Each interleaved 
unit consists of video object units VOBU, each of which comprises one or more MPEG GOP with a data quantity 
equivalent to a 0.4-1 .0 second presentation time. 

20 [0306] When data is interleaved, the interleaved units ILVU forming separate video objects VOB are alternately 
arrayed. If any of the plural interleave units interleaved to the shortest of plural VOB is shorter than the minimum 
interleave unit length, or if the total length of plural interleave units in any but the shortest of plural VOB, is greater than 
the shortest interleave distance, an underflow state will occur when this shortest interleave VOB is reproduced, and 
non-seamless reproduction will result instead of seamless reproduction. 

25 [0307] Before encoding occurs in the present embodiment it is therefore determined whether interleaving is possible. 
Whether interleaving is possible can be determined from the lengths of the pre-encoded streams. Because the effect 
of interleaving can thus be determined in advance, the need for data reprocessing, i.e., the need to adjust the inter- 
leaving conditions and record after interleaving and encoding are once completed, can be prevented. 
[0308] The interleaving method for recording to an optical disk according to the present invention is described spe- 

30 cifically below, starting with the bit rate of the recorded VOB and the performance and other conditions of the reproduced 
disk. 

[0309] When interleaving is applied, the relationship between the track buffer input transfer rate Vr and output transfer 
rate Vo is defined as Vr > Vo as previously described. The maximum bit rate of each VOB to be interleaved is therefore 
set to less than the track buffer input transfer rate Vr. The maximum bit rate B of each VOB is therefore set to a value 
35 less than Vr. If in the evaluation determining whether interleaving enabling seamless reproduction is possible it is 
assumed that all plural VOB to be interleaved are encoded with constant bit rate (CBR) coding at the maximum bit 
rate, the interleave unit data volume is maximized, the time that can be reproduced from the data volume that can be 
placed at the jumpable distance is shortened, and the interleave conditions are stringent. 

[0310] Note that it is assumed below that each VOB is encoded with constant bit rate (CBR) coding at the maximum 
40 bit rate. 

[0311] it is assumed in the following description of the reproduction apparatus that the distance the optical pickup 
can jump within disk jump time JT expressed as a data quantity is jumpable distance JM, and the input data bit rate 
to the track buffer of the reproduction apparatus is BIT 

[0312] In terms of an actual reproduction apparatus, the disk jump time JT is 400 msec, the jumpable distance JM 
45 is 250 Mbit in this jump time JT. The maximum bit rate B of the VOB in the MPEG system is defined as 8.8 Mbps 

considering that an average 6 Mbps is required to obtain image quality superior to that of a conventional VCR recording. 

[0313] Based on the jumpable distance JM, jump time JT, and data read times from disk, the target values for the 

smallest interleaved unit size ILVUM and the smallest interleave unit presentation time ILVUMT are calculated first. 

The smallest interleave unit presentation time ILVUMT can be obtained from equation 3, and the smallest interleaved 
so unit size ILVUM can be obtained from equation 4 below. 



ILVUMT > JT + ILVUM/BIT [3] 
ILVUMT x B = ILVUM [4] 



[0314] From equation 3 the smallest interleave unit presentation time ILVUMT is determined to be 2 sec., and the 
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smallest GOP block data GM is 17.6 Mbit. Thus, if the smallest interleave unit, which is the smallest unit of the data 
layout, is a 2-sec. equivalent data quantity and each GOP comprises 15 frames (NTSC format), then the smallest 
interleave unit is equivalent to the data stored in four GOP. 

[0315] As also stated above, one interleaving condition is that the interleave distance be less than or equal to the 
5 jumpable distance. This means that except for the VOB with the shortest presentation time, the total presentation time 
of plural VOB to which interleaving is applied must be shorter than the time that can be reproduced within the interleave 
distance. If as in the above example the jumpable distance JM is 250 Mbit and the maximum VOB bit rate is 8.8 Mbps, 
the time JMT that can be reproduced with the data quantity read within the jumpable distance JM is 28.4 sec. It is then 
possible to calculate an interleavable conditions equation using these values. If each VOB in the interleaved block 
10 region is divided into the same number of interleaved, block and this number of interleaved divisions is v, equation 5 
can be obtained from the conditions of the smallest interleave unit length. 

(presentation time of shortest VOB)/ILVUMT < v [5] 

15 

Equation 6 can then also be obtained from the jumpable presentation time conditions. 

v < (presentation time of all VOB except shortest VOB)/JMT [6] 

20 

[0316] Plural VOB can in principle be interleaved if the above conditions are fulfilled. In practical terms, VOBU com- 
pensation to the value calculated according to the above equations is necessary because the interleave units are 
limited to the VOBU boundaries. To compensate equations 2, 3, and 4, it is necessary to add the maximum VOBU time 
(1 . 0 sec.) to the smallest interleave unit presentation time ILVUMT, and reduce the presentation time in the interleave 

25 distance JMT by the maximum VOBU time. 

[0317] If it is determined that an interleaved sequence enabling seamless playback is not possible as a result of 
calculating the conditions for interleaving the VOB scenes before encoding, it is necessary to increase the number of 
interleave unit divisions. This means increasing the length of the scene forming the shortest VOB by moving the scene 
therebefore or after. It is also necessary to add the same scene added to the shortest scene to the other scenes. In 

30 general, the interleave distance is significantly greater than the shortest interleave unit, and the rate of increase in the 
value on the left side of equation 4 is greater than the increase in the value on the right side of equation 6. As a result, 
increasing the size of the shifted (moved) scene can result in satisfying the above conditions. 
[0318] The track buffer input transfer rate Vr and output transfer rate Vo must be in the relationship Vr > Vo as 
described above for the data in the interleaved blocks. The optical pickup may also jump immediately after entering 

35 an interleaved block region from a contiguous block region, in which case it is also necessary to store the data from 
immediately before the interleaved block region. This makes it necessary to suppress the bit rate of part of the data in 
the VOB immediately before the interleaved block region. 

[0319] It is also possible that the optical pickup may jump immediately after entering an interleaved block from a 
contiguous block, in which case it is necessary to suppress the maximum bit rate of the contiguous block immediately 
40 before the interleaved block, and store the data from immediately before the interleaved block to the track buffer. The 
target for suppressing the maximum bit rate, i.e., the amount the maximum bit rate is suppressed, is the presentation 
time of the shortest interleave unit, which can be calculated from the maximum bit rate of the interleaved block repro- 
duced after the contiguous block. 

[0320] Note that while the above operation has assumed that ail VOB are divided into the same number of interleave 
45 units, it is also possible to group the VOB into a group of VOB containing u divisions (interleave units) and a group of 
VOB containing u+1 divisions when there is a significant difference in VOB length. 

[0321] This can be accomplished by defining the minimum number of divisions in each VOB obtained from equation 
5 as u and using this value u for all VOB from which a greater number of interleave units cannot be obtained, and using 
the number of interleave units obtained from equation 4 up to (u + 1) for larger VOB. This is illustrated in Fig. 41 . 

50 [0322] The data structure of the interleaved unit ILVU in another embodiment of the invention is shown in Fig. 42. In 
this structure the navigation pack NV described with reference to Fig. 20 is placed at the beginning of each video object 
unit VOBU, and the data from one navigation pack NV to immediately before the next navigation pack NV is defined 
as one video object unit VOBU. The video object unit VOBU boundaries determine the boundaries of the interleaved 
unit ILVU, each of which is longer than the length of the shortest interleaved unit, which is obtained from the decoder 

55 performance, bit rate, and other considerations using equations 5 and 6. 

[0323] Each of the component video object units VOBU contains a navigation pack NV (management information 
pack), each navigation pack NV containing an ILVU end pack address ILVU_EA indicating the address of the last pack 
in the ILVU to which the VOBU belongs, and the next-ILVU start address NTJLVU_SA. The position to which the start 
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address NT_ILVU_SA points is the address of the first pack (NV) in the next interleaved unit ILVU to be reproduced, 
while the ILVU end pack address ILVILEA points to the end position of the interleaved unit ILVU containing the end 
pack address ILVU_EA. Note that as previously described these addresses are expressed as the number of sectors 
from the navigation pack NV containing the address information. 
5 [0324] When a video object unit VOBU is in an interleaved block region, the nexHLVU start address NT_ILVU_SA 
points to the start address of the next ILVU, similarly expressed as the number of sectors from the navigation pack NV 
containing the address information. 

[0325] It is therefore possible after reading the first navigation pack NV data in the interleaved unit ILVU to obtain 
the location of the next interleaved unit ILVU to be reproduced, and to what disk position the current interleaved unit 
10 ILVU is written. This makes it possible to.only read the interleaved units ILVU,. and to smoothly jump to the next inter- 
leaved unit ILVU. 

Multi-scene control 

15 [0326] The multi-scene period is described together with the concept of multi-scene control according to the present 
invention using by way of example a title comprising scenes recorded from different angles. 

[0327] Each scene in multi-scene control is recorded from the same angle, but may be recorded at different times 
or may even be computer graphics data. The multi-angle scene periods may therefore also be called multi-scene 
periods. 

20 

Parental control 

[0328] The concept of recording plural titles comprising alternative scenes for such functions as parental lock control 
and recording director's cuts is described below using Fig. 43. 

25 [0329] An example of a multi-rated title stream providing for parental lock control is shown in Fig. 43. When so-called 
"adult scenes" containing sex, violence, or other scenes deemed unsuitable for children are contained in a title imple- 
menting parental lock control, the title stream is recorded with a combination of common system streams SSa, SSb, 
and Sse, an adult-oriented system stream SSc containing the adult scenes, and a child-oriented system stream SSd 
containing only the scenes suitable for children. Title streams such as this are recorded as a multi-scene system stream 

30 containing the adult-oriented system stream Ssc and the child-oriented system stream Ssd arrayed to the multi-scene 
period between common system streams Ssb and Sse. 

[0330] The relationship between each of the component titles and the system stream recorded to the program chain 
PGC of a title stream thus comprised is described below. 

[0331] The adult-oriented title program chain PGC1 comprises in sequence the common system streams Ssa and 
35 Ssb, the adult-oriented system stream Ssc, and the common system stream Sse. The child-oriented title program chain 
PGC2 comprises in sequence the common system streams Ssa and Ssb, the child-oriented system stream Ssd, and 
the common system stream Sse. 

[0332] By thus arraying the adult-oriented system stream Ssc and child-oriented system stream Ssd to a multi-scene 
period, the decoding method previously described can reproduce the title containing adult-oriented content.by repro- 
40 ducing the common system streams Ssa and Ssb, then selecting and reproducing the adult-oriented system stream 
Ssc, and then reproducing the common system stream Sse as instructed by the adult-oriented title program chain 
PGC1 . By alternatively following the child-oriented title program chain PGC2 and selecting the child-oriented system 
stream Ssd in the multi-scene period, a child-oriented title from which the adult-oriented scenes have been expurgated 
can be reproduced. 

45 [0333] This method of providing in the title stream a multi-scene period containing plural alternative scenes, selecting 
which of the scenes in the multi-scene period are to be reproduced before playback begins, and generating plural titles 
containing essentially the same title content but different scenes in part, is called parental lock control. 
[0334] Note that parental lock control is so named because of the perceived need to protect children from undesirable 
content. From the perspective of system stream processing, however, parental lock control is a technology for statically 

so generating different title streams by means of the user pre-selecting specific scenes from a multi-scene period. Note, 
further, that this contrasts with multi-angle scene control, which is a technology for dynamically changing the content 
of a single title by. means of the user selecting scenes from the multi-scene period freely and in real-time during title 
playback. 

[0335] This parental lock control technology can also be used to enable title stream editing such as when making 
55 the director's cut. The director's cut refers to the process of editing certain scenes from a movie to, for example, shorten 
the total presentation time. This may be necessary, for example, to edit a feature-length movie for viewing on an airplane 
where the presentation time is too long for viewing within the flight time or certain content may not be acceptable. The 
movie director thus determines which scenes may be cut to shorten the movie. The title can then be recorded with 
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both a full-length, unedited system stream and an edited system stream in which the edited scenes are recorded to 
multi-scene periods. At the transition from one system stream to another system stream in such applications, parental 
lock control must be able to maintain smooth playback image output. More specifically, seamless data reproduction 
whereby a data underflow state does not occur in the audio, video, or other buffers, and seamless information repro- 
5 duction whereby no unnatural interruptions are audibly or visibly perceived in the audio and video playback, are nec- 
essary. 

Multi-angle control 

w [0336] The concept of multi-angle scene control in the present invention is described next with reference to Fig. 44. .... 
In general, multimedia titles are obtained by recording both the audio and video information (collectively "recording" 
below) of the subject over time T. The angled scene blocks #SC1, #SM1, #SM2, #SM3, and #SC3 represent the 
multimedia scenes obtained at recording unit times T1 , T2, and T3 by recording the subject at respective camera 
angles. Scenes #SM1, #SM2, and #SM3 are recorded at mutually different (first, second, and third) camera angles 

15 during recording unit time T2, and are referenced below as the first, second, and third angled scenes. 

[0337] Note that the multi-scene periods referenced herein are basically assumed to comprise scenes recorded from 
different angles. The scenes may, however, be recorded from the same angle but at different times, or they may be 
computer graphics data. The multi-angle scene periods are thus the multi-scene periods from which plural scenes can 
be selected for presentation in the same time period, whether or not the scenes are actually recorded at different 

20 camera angles. 

[0338] Scenes #SC1 and #SC3 are scenes recorded at the same common camera angle during recording unit times 
T1 and T3, i.e., before and after the multi-angle scenes. These scenes are therefore called "common angle scenes." 
Note that one of the multiple camera angles used in the multi-angle scenes is usually the same as the common camera 
angle. 

25 [0339] To understand the relationship between these various angled scenes, multi-angle scene control is described 
below using a live broadcast of a baseball game for example only. 

[0340] The common angle scenes #SC1 and #SC3 are recorded at the common camera angle, which is here defined 
as the view from center field on the axis through the pitcher, batter, and catcher. 

[0341] The first angled scene #SM1 is recorded at the first multi-camera angle, i.e., the camera angle from the 
30 backstop on the axis through the catcher, pitcher, and batter. The second angled scene #SM2 is recorded at the second 
multi-camera angle, i.e., the view from center field on the axis through the pitcher, batter, and catcher. Note that the 
second angled scene #SM2 is thus the same as the common camera angle in this example. It therefore follows that 
the second angled scene #SM2 is the same as the common angle scene #SC2 recorded during recording unit time 
T2. The third angled scene #SM3 is recorded at the third multi-camera angle, i.e., the camera angle from the backstop 
35 focusing on the infield. 

[0342] The presentation times of the multiple angle scenes #SM1 , #SM2, and #SM3 overlap in recording unit time 
T2; this period is called the "multi-angle scene period." By freely selecting one of the multiple angle scenes #SM1, 
#SM2, and #SM3 in this multi-angle scene period, the viewer is able to change his or her virtual viewing position to 
enjoy a different view of the game as though the actual camera angle is changed. Note that while there appears to be 

40 a time gap between common angle scenes #SC1 and #SC3 and the multiple angle scenes #SM1 , #SM2, and #SM3 
in Fig. 44, this is simply to facilitate the use of arrows in the figure for easier description of the data reproduction paths 
reproduced by selecting different angled scenes. There is no actual time gap during playback. 
[0343] Multi-angle scene control of the system stream based on the present invention is described next with reference 
to Fig. 23 from the perspective of connecting data blocks. The multimedia data corresponding to common angle scene 

45 #sc is referenced as common angle data BA, and the common angle data BA in recording unit times T1 and T3 are 
referenced as BA1 and BA3, respectively. The multimedia data corresponding to the multiple angle scenes #SM1, 
#SM2, and #SM3 are referenced as first, second, and third angle scene data MA1 , MA2, and MA3. As previously 
described with reference to Fig. 44, scenes from the desired angled can be viewed by selecting one of the multiple 
angle data units MA1 , MA2, and MA3. There is also no time gap between the common angle data BA1 and BA3 and 

50 the multiple angle data units MA1 , MA2, and MA3. 

[0344] In the case of an MPEG system stream, however, intermittent breaks in the playback information can result 
between the reproduced common and multiple angle data units depending upon the content of the data at the connec- 
tion between the selected multiple angle data unit MA1 , MA2, and MA3 and the common angle data BA (either the 
first common angle data BA1 before the angle selected in the multi-angle scene period or the common angle data BA3 

55 following the angle selected in the multi-angle scene period). The result in this case is that the title stream is not naturally 
reproduced as a single contiguous title, i.e., seamless data reproduction is achieved but non-seamless information 
reproduction results. 

[0345] The multi-angle selection process whereby one of plural scenes is selectively reproduced from the multi-angle 
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scene period with seamless information presentation to the scenes before and after is described below with application 
in a digital video disk system using Fig. 23. 

[0346] Changing the scene angle, i.e., selecting one of the multiple angle data units MA1 , MA2, and MA3, must be 
completed before reproduction of the preceding common angle data BA1 is completed. It is extremely difficult, for 

5 example, to change to a different angle data unit M A2 during reproduction of common angle data BA1 . This is because 
the multimedia data has a variable length coded MPEG data structure, which makes it difficult to find the data break 
points (boundaries) in the selected data blocks. The video may also be disrupted when the angle is changed because 
inter-frame correlations are used in the coding process. The group_of_pictures GOP processing unit of the MPEG 
standard contains at least one refresh frame, and closed processing not referencing frames belonging to another GOP 

10 is possible. within, this GOP processing unit. 

[0347] In other words, if the desired angle data, e. g., MA3, is selected before reproduction reaches the multi-angle 
scene period, and at the latest by the time reproduction of the preceding common angle data BA1 is completed, the 
angle data selected from within the multi-angle scene period can be seamlessly reproduced. However, it is extremely 
difficult while reproducing one angle to select and seamlessly reproduce another angle within the same multi-angle 

15 scene period. It is therefore difficult when in a multi-angle scene period to dynamically select a different angle unit 
presenting, for example, a view from a different camera angle. 

[0348] The method of switching between scene angles within a multi-angle scene period is described in further detail 
below with reference to Fig. 76, Fig. 77, and Fig. 45 based on the digital video disk data structure described above. 
[0349] Fig. 76 shows the presentation times of the smallest angle.switching units in each of the multiple angle data 

20 units MA1 , MA2, and MA3 shown in Fig, 23: Within the context of the digital video disk system, the multiple angle data 
units MA1 , MA2, and MA3 are the title editing units, i.e., the video objects VOB. Each multiple angle data unit (VOB) 
comprises plural (three in this example) interleave units ILVU as the smallest possible angle switching unit. Note that 
each interleaved unit ILVU thus contains a particular number of GOP. The first angle data MA1 thus contains angle 
switching units A51 , A52, and A53. 

25 [0350] The angle switching units A51 , A52, and A53 of the first angle data MA1 have presentation times of 1 , 2, and 
3 seconds, respectively, resulting in the first angle data MA1 having a total presentation time of 6 sec. The second 
angle data MA2 similarly comprises angle switching units B51, B52, and B53 with presentation times of 2, 3, and 1 
second, respectively, and the third angle data MA3 comprises angle switching units C51 , C52, and C53 with presen- 
tation times of 3, 1 , and 2 seconds, respectively. It should be noted that while the angle data MA1 , MA2, and MA3 each 

30 has a 6 sec. presentation time, the presentation time can obviously be set to various other particular values. 

[0351] The following example describes switching to and beginning the presentation of another (next) angle view 
while reproducing any particular interleave unit. For example, if the angle is changed from first angle data MA1 to 
second angle data MA2 while reproducing interleave unit A51 , reproduction of interleave unit A51 stops and reproduc- 
tion of the second interleave unit B52 in the selected second angle data MA2 starts. This change intermits both the 

35 audio and video presentation, resulting in non-seamless information presentation. 

[0352] Likewise, if the user chooses to switch to the scene corresponding to the third angle data MA3 during repro- 
duction of the second interleave unit B52 in the second angle data MA2, reproduction of interleave unit B52 stops and 
reproduction of interleave unit C53 starts. This operation again intermits both the audio and video presentation, resulting 
in non-seamless information presentation. 

40 [0353] Note that in the above cases switching between multiple angle data during data reproduction stops reproduc- 
tion of the angle being presented, and seamless information presentation, i.e., presentation without intermitting the 
audio and video, does not occur. 

[0354] The method of switching the angle after completing interleave unit presentation is described below. 
[0355] If the user selects second angle data MA2 during reproduction of interleave unit A51 in first angle data MA1 , 

45 for example, the second interleave unit B52 in the second angle data MA2 is selected from the point reproduction of 
interleave unit A51 with a presentation time of 1 sec. is completed. Because the start of interleave unit B52 is 2 sec. 
from the beginning of the multi-angle scene period, there is a gap between the end of interleave unit A51 reproduction 
at 1 sec. into the multi-angle scene period and the beginning of interleave unit B52 at 2 sec. into the multi-angle scene 
period, and there is therefore no time-base contiguity in reproduction. More specifically, because there is no continuity 

50 in the audio, the audio cannot be seamlessly reproduced. 

[0356] A similar scenario results if the user then switches to the angle scene corresponding to the third angle data 
MA3 during reproduction of the second interleave unit B52 in the selected second angle data MA2, i.e., interleave unit 
C53 is switched to after reproduction of interleave unit B52 is completed. In this case the completion of interleave unit 
B52 reproduction is at 5 sec. from the beginning of the multi-angle scene period, the beginning of interleave unit C53 

55 is at 4 sec. from the beginning of the multi-angle scene period, and there is again no time-base contiguity. The con- 
nection between the audio and video components of the interleave units B52 and C53 is also not good. 
[0357] It is therefore necessary for the interleave units of each angle to have the same reproduction time and contain 
the same number of video frames in order to achieve seamless information switching between multiple angles. 
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[0358] The internal structure of the interleave units is shown in further detail in Fig. 77. As shown in Fig. 77, each of 
the interleave units ILVUbl , ILVUb2, ILVUd , and ILVUc2 contains interleaved audio and video packets, indicated as 
A and V, respectively. 

[0359] In general, the data size and presentation time of each audio packet A is the same. In this example, interleave 
5 units ILVUbl, ILVUb2, ILVUd , and ILVUc2 respectively contain 3, 2, 2, and 3 audio packets. Therefore, angle data 
MAB and MAC each contain 5 audio packets and 13 video packets in multi-angle scene period T2. 
[0360] Angle switching control within a multi-angle scene period comprising a multi-angle system stream (VOB) with 
this packet structure is described below. 

[0361] If the angle is switched from interleaved unit ILVUbl to interleaved unit ILVUc2, for example, the total number 
10 of audio packets in these two interleave units ILVUbl and ILVUc2 is 6, which is one more than the specified total 
number of 5 audio packets in this multi-angle scene period T2. This would therefore result in an overlap of one audio 
packet in the reproduced audio if these two 1LVU were connected and reproduced. Likewise, if the angle is changed 
between the two interleaved units ILVUd and ILVUb2 containing only two audio packets each, the total of 4 audio 
packets is one less than the specified total number of 5 audio packets in this multi-angle scene period T2. Connecting 
15 and reproducing these two interleaved units would therefore result in an audio packet shortage, and a break in the 
audio equal to the duration of one audio packet. It is therefore clear that when the number of audio packets in the 
connected interleaved units is not equal to the specified number of audio packets in that multi-angle scene period, the 
audio will not be satisfactorily connected, and non-seamless information presentation containing noise and/or inter- 
mittent audio will result. 

20 [0362] Fig. 45 shows multi-angle scene control when the multi-angle data MAB and MAC shown in Fig. 77 contain 
different audio data. Common angle data BA1 and BA3 contain the common audio data of the common scenes before 
and after the multi-angle scene period. The first angle data MAB comprises first angle interleave unit audio data ILVUbl 
and ILVUb2, the smallest angle switching unit within the multi-angle scene period. The second angle data MAC likewise 
comprises second angle interleave unit audio data ILVUd and ILVUc2. 

25 [0363] Fig. 14 shows the audio waves resulting from the audio data recorded to the multi-angle data MAB and MAC 
within multi-angle scene period T2. The contiguous audio output of angle data MAB is formed from the two interleaved 
unit audio data ILVUbl and ILVUb2. The contiguous audio output of angle data MAC is likewise formed from the two 
interleaved unit audio data ILVUd and ILVUc2. 

[0364] What happens when reproduction of angle data MAC is switched to during reproduction of the first interleave 
30 unit audio data ILVUbl in angle data MAB is described below. In this case reproduction of interleaved unit ILVUc2 
occurs after reproduction of interleaved unit ILVUbl is completed. As a result, the audio wave resulting from these two 
interleaves units is a mixed wave as shown by audio wave MAB-C in Fig. 12. Note that the resulting audio output wave 
is non-contiguous at the angle switching point, and the audio connection is therefore not completely satisfactory. 
[0365] If the audio data is encoded using Dolby AC-3 coding, the problems are even greater because AC-3 coding 
35 use time-base correlations. Specifically, because the audio is coded using time-base signal correlations, reproduction 
becomes impossible at the angle switching point when the user attempts to stop the audio data from one angle and 
switch to the audio data of another angle during reproduction in a multi-angle scene period. 

[0366] Thus, when each angle in a multi-angle scene period contains discrete audio data, the audio output may be 
intermitted between the connected data units when the angle is changed. Depending on the content of the data in such 
40 cases, noise and/or audio interruptions may occur during reproduction, resulting even in irritation or discomfort to the 
user. Such discomfort is a product of the non-contiguity in the reproduced information content, and can therefore be 
avoided by assuring information contiguity or preventing information intermittence. Seamless information presentation 
can thus be achieved. 

[0367] Multi-angle scene control according to the present invention is described next with reference to Fig. 46. Three 
45 angle data MA1 , MA2, and MA3 are written to multi-angle scene period T2 in this example. 

[0368] The first angle data MA1 thus contains angle switching units A11 , A12, and A13 as the smallest possible 
angle switching units. These angle switching units A51 , A52, and A53 have presentation times of 2, 1 , and 3 seconds, 
respectively. 

[0369] The second angle data MA2 similarly comprises angle switching units B1 1 , B1 2, and B1 3, and the third angle 
50 data MA3 comprises angle switching units C11, C12, and C13. Each of these angle switching units also respectively 
have presentation times of 2, 1 , and 3 seconds. 

[0370] Because the synchronized interleave units thus have the same presentation times, contiguous audio and 
video reproduction can be achieved without interruptions or overlaps at the angle switching position even when the 
user switches from one angle to another angle, and seamless information presentation can therefore be achieved as 
55 previously described. 

[0371] To achieve the data structure shown in Fig. 46, i.e., to set the actual presentation times of the smallest angle 
switching units of the audio and video material in the multi-angle scene period to the same times, the number of re- 
production frames in each interleave unit must be the same. MPEG data compression normally operates on the GOP 
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unit level, and two parameters defining the GOP structure are the M and N values. M is the I- or P-picture cycle, and 
N is the number of frames contained in that GOP. Furthermore, changing the M or N parameter setting frequently during 
the encoding process simply complicates MPEG video encoding control, and is not, therefore, normally done. 
[0372] A method of achieving the data structure shown in Fig. 46, i.e., a method of setting the actual presentation 
5 times of the smallest angle switching units of the audio and video material in the multi-angle scene period to the same 
times, is described below with reference to Fig. 78. 

[0373] As the example shown in Fig. 78 writes two angle data units MAB and MAC to the multi-angle scene period, 
and each angle data unit comprises two interleaved units ILVUbl and ILVUb2 I and ILVUd and ILVUc2, respectively. 
This is for simplicity of discussion only. Each interleaved unit is a GOP structure. The GOP structures in the mufti-angle 

10 scene period set the M and N parameters of the synchronized interleaved units ILVUbl and ILVUd to the same values, 
and similarly set the M and N parameters of the synchronized interleaved units ILVUb2 and ILVUc2 to the same values. 
By thus using the same GOP structure parameters in both angle data MAB and MAC, the presentation times of the 
AV data in different angles can be set to the same on the smallest angle switching unit level. Thus, when the angle is 
changed from first angle data MAB ILVUb2 to angle data MAC ILVUc2, contiguous video presentation without video 

is intemnittence or overlap at the angle switching position can be achieved because the angle switching timing between 
these two ILVU is the same. 

[0374] A method for actually setting the audio data presentation times on the smallest angle switching unit level to 
the same value in different angles is described next with reference to Fig. 79. As in Fig. 77, the interleave units ILVUbl , 
ILVUb2, ILVUd , and ILVUc2 contain interleaved audio and video packets, indicated as A and V, respectively. 

20 [0375] The audio packet A data size and presentation times are normally constant. As also shown in the figure, 
ILVUbl and ILVUd in the multi-angle scene period have the same number of audio packets (2) , and ILVUb2 and ILVUc2 
have the same number of audio packets (3). By thus writing the same number of audio packets to the synchronized 
interleaved units ILVU of different angle data MAB and MAC, the audio data presentation times can be set the same 
on the smallest angle switching unit level in different angles. Thus, when the view is changed between angle data MAB 

25 and MAC, contiguous audio presentation without audio intermittence or overlap at the angle switching position can be 
achieved because the angle switching timing does not change. 

[0376] However, if each of the smallest angle switching units in the multi-angle scene period contain audio data 
describing discrete audio waves, it may not be possible (as described above with reference to Fig. 12) to achieve 
contiguous audio data reproduction at the angle switching point by simply coding each smallest angle switching unit 

30 (ILVU) with the same audio data presentation time. It is possible to avoid intermitting the audio presentation, however, 
by writing the same (common) audio data to each angle within a multi-angle scene period on the smallest angle switch- 
ing unit (ILVU) level. As previously described, seamless information presentation can be achieved by arraying the data 
to assure contiguous information content before and after the connection points in the reproduced data, or by formatting 
the data in closed data units that are completely reproduced at the switching points. 

35 [0377] A further multi-angle scene period data structure in which common audio data is written to each different angle 
is shown in Fig. 80. Unlike the structure shown in Fig. 45, this data structure is achieved by writing audio data that is 
complete within each switching unit (interleaved unit ILVU) to the angle data MAB and MAC. 
[0378] Because the audio data is thus complete within each interleaved unit ILVU, audio data producing a discom- 
forting audio wave as a result of connecting different audio waves at the angle switching point will not result even when 

40 the angle is changed by switching from first angle interleaved unit ILVUbl to second angle interleaved unit ILVUc2 
within an encoded audio data multi-angle scene period thus structured. 

[0379] Note that if the audio data is formed with the same audio wave on the interleaved unit ILVU level, seamless 
information presentation can obviously be achieved just as when the audio wave data is complete within each inter- 
leaved unit ILVU. 

45 [0380] When the audio data is coded using Dolby AC-3 audio coding, the time-base correlations can be retained 
even when the angle is changed, and contiguous audio reproduction can be achieved at the angle switching point 
without noise or intermittence, because common audio data is written to the smallest angle switching units (ILVU) of 
the angle data or is complete within each interleaved unit ILVU. 

[0381] It should be noted that the present invention shall not be limited to only two or three types of angle data MA 
so within the multi-angle scene period. The multi-angle scene period T2 shall also not be limited to VOB units, and may 
extend for the duration of the title stream. 

[0382] It is therefore possible to achieve seamless information presentation as previously defined. 
[0383] The operation of multi-angle control based on the digital video disk data structure is thus as described above. 
[0384] A method of recording to the recording medium multi-angle control data specifically enabling selection of a 
55 different angle data unit while reproducing one angle data unit in the same mufti-angle scene period is, however, 
described below. 

[0385] The common angle data BA1 shown in Fig. 23 is arrayed to a contiguous block region, the interleave unit 
data of the multi-angle data MA1, MA2, and MA3 in the multi-angle scene period is arrayed to an interleaved block 
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region, and the common angle data BA3 following thereafter is arrayed to the next contiguous block region. Referenced 
to the data structure shown in Fig. 16, the common angle data BA1 constitutes one ceil, the multi-angle data MA1 , 
MA2, and MA3 each constitute one cell, the cells corresponding to the multi-angle data MA1 , MA2, and MA3 constitute 
one cell block (the cell block mode CBM of the MA1 cell = "cell block start", the CBM of the MA2 cell = "between first 
5 and last cells", and the CBM of the MA3 cell = "cell block end"), and these cell blocks are angle blocks (cell block type 
CBT = angle). The common angle data BA3 is the cell that connects to that angle block. The cell-cell connections are 
also set for seamless playback (seamless playback flag SPF = "seamless playback"). 

[0386] The structure of the stream containing a multi-angle scene period and the basics of the on-disk layout in the 
preferred embodiment of the invention are shown in Fig. 47. The multi-angle scene period is a period in which the 
io stream can.be freely changed by the user. In a stream with the structure shown in Fig, 47, it is possible to switch to 
VOB-C or VOB-D during reproduction of VOB-B. It is therefore likewise possible to switch to VOB-B or VOB-D during 
reproduction of VOB-C, and to switch to VOB-C or VOB-B during reproduction of VOB-D. 

[0387] As previously described, the unit for changing the angle, i.e., the angle selection unit, is the smallest interleave 
unit obtained from the conditions defined by equations 3 and 4, and is defined as the angle interleave unit A-ILVU. A- 
15 ILVU management information is also added to the angle interleave unit A-ILVU. The navigation pack NV previously 
described corresponds to this A-ILVU management information. 

[0388] An example in which the last pack address in the current A-ILVU and the address of the next A-ILVU are. 
recorded for plural angles is shown as another embodiment of the invention in Fig. 48. While this figure is very similar 
to Fig. 42, the angle interleave unit A-ILVU comprises two video object units VOBU in this example, and the navigation 

20 pack NV of each VOBU contains the ILVU end pack address ILVU_EA indicating the address of the last pack in the 
ILVU to which the VOBU belongs, and the next-ILVU start address for each angle data block #1 - #9 
(SML_AGL_C1_DSTA - SML_AGL_C9_DSTA). These addresses are expressed as the number of sectors from the 
navigation pack NV containing the address information. In fields in which there is no angle data, information indicating 
that there is no angle data, e.g., 0, is recorded. By thus recording the last pack address and the next-ILVU start address 

25 for each angle data block #1 - #9, it is possible to know the address of each next selectable angle and to switch to a 
different angle scene without reading unnecessary angle information. 

[0389] For interleaving data in the multi-angle scene period, all angles are interleaved to the same time reference 
(interleave boundary) using the angle interleave unit A-ILVU for the shortest read time. This is to enable the angle to 
be changed as quickly as possible within the performance limits of the DVD player. Interleave unit data is temporarily 

30 stored to the track buffer, and the data for the angle selected next is then input to the track buffer. However, the next 
angle cannot be reproduced until the data for the previous angle (the angle being reproduced when the angle was 
changed) in the track buffer has been consumed. It is therefore necessary to minimize the size of the angle interleave 
unit A-ILVU in order to maximize the speed at which the next angle scene can be selected and presented. A common 
interleave unit size and boundary position must therefore be used between the VOB constituting each angle. 

35 [0390] This means that the reproduction time of the encoded video stream forming the VOB must be the same for 
each VOB, and the time at which each interleave unit can be reproduced must be the same, i.e., the interleave unit 
boundaries must be common, for each interleave unit at the same reproduction time in each angle. More specifically, 
the VOBs constituting each angle must be divided into the same number of interleave units, and the reproduction time 
of said interleave units must be the same in each angle, i.e., the VOBs constituting each angle must be divided into 

40 the same number N of interleave units, and the k-th interleave unit (where 1 < k < N) in each angle must have the same 
reproduction time. 

[0391] Moreover, to seamlessly reproduce the interleave units between each angle, the encoded stream must be 
complete within each interleave units. This means that under the MPEG standard, a closed GOP structure using a 
compression method that does not reference any frames outside the interleave unit must be used. If this method is 
45 not used, it is not possible to seamless connect and reproduce interleave units from different angles. By using this 
VOB structure and interleave unit alignment, however, contiguous reproduction is possible time-wise even if the angle 
is changed. 

[0392] The number of interleave units in the multi-angle scene period is determined by the number of interleave units 
in the other angles that can be arrayed within the jumpable distance after an interleave unit is read. Regarding the 

so arrangement of the interleaved units in each angle, the interleave units reproduced first in each angle are arrayed in 
the angle sequence, and are subsequently followed by the interleave units reproduced next in each angle being arrayed 
in the angle sequence. Thus, if the number of angles is M (where M is a natural number, and 1 < M < 9), the m-th angle 
is angle #m (where m is a natural number, and 1 < m < M), the number of interleave units is N (where N is a natural 
number greater than or equal to 1), and the n-th interleave unit in a VOB is interleave unit #n (where n is a natural 

55 number, and 1 < n < N), the interleave unit sequence is angle #1 interleave unit #1 , angle #2 interleave unit #1 , angle 
#3 interleave unit #1 , . . angle #M interleave unit #1 , angle #1 interleave unit #2, angle #2 interleave unit #2, . . . 
[0393] If the interleave unit length of each angle is the shortest read time for seamlessly selected angles where the 
angle change is seamless, the maximum distance that must be jumped when moving between angles is the distance 



EP 0 918 438 B1 



from the first angle interleave unit in the sequence to the last interleave unit in the interleave unit sequence of the next 
angle reproduced within the sequence of angle interleave units reproduced at the same time. Thus, if the number of 
angles is An, the jump distance must satisfy the following equation 7. 

5 

max. angle ILVU length x (An - 1 ) x 2 < jumpable distance [7] 

[0394] With non-seamless multi-angle switching, each angle must be seamlessly reproduced, but seamless repro- 
duction is not required when moving between angles. Therefore, if the interleave unit length of each angle is the shortest 
, ,. 10 read time, the maximum distance that must be jumped when moving between angles is the distance between the 
interleave units of each angle. Thus, if the number of angles is An, the jump distance must satisfy the following equation 
8. 

15 max. angle ILVU length x (An - 1) < jumpable distance [8] 

[0395] The method of managing addresses on the switching unit level between multiple angle data VOB in multi- 
angle scene periods is described below with reference to Figs. 49 and 50. In Fig. 49 the angle interleave unit A-ILVU 
is the data switching unit, and the address of another angle interleave unit A-ILVU is written to the navigation pack NV 
20 of each angle interleave unit A-ILVU. Fig. 49 shows the address description achieving seamless reproduction, i.e., 
uninterrupted reproduction of the audio and video data. This addressing method specifically provides for control where- 
by only the data for the interleave units of the angle to be reproduced is read into the track buffer when the angle is 
changed. 

[0396] Fig. 50 shows an example in which the video object unit VOBU is the data switching unit, and the address of 
25 another video object unit VOBU is written to the navigation pack NV of each video object unit VOBU. This addressing 
method provides for non-seamless reproduction control whereby reproduction can be changed as quickly as possible 
to another angle near the time when the angle is changed. 

[0397] In Fig. 49, each angle interleave unit A-ILVU in the three multi-angle data VOB-B, VOB-C, and VOB-D records 
as the address of the next angle interleave unit A-ILVU to be reproduced the address of a chronologically later A-ILVU. 
30 Note that VOB-B is designated angle #1, VOB-C is angle #2, and VOB-D is angle #3. The multi-angle data VOB-B 
consists of angle interleave units A-ILVUb1, A-ILVUb2, and A-ILVUb3. multi-angle data VOB-C similarly consists of 
angle interleave units A-ILVUc1, A-ILVUc2, and A-ILVUc3, and multi-angle data VOB-D of angle interleave units A- 
ILVUdl , A-ILVUd2, and A-ILVUd3. 

[0398] The navigation pack NV of angle interleave unit A-ILVUb1 contains SML_AGL_C#1_DSTA pointing to the 
35 relative address of the next angle interleave unit A-ILVUb2 in VOB-B, as shown by line Pblb; SML_AGL_C#2_DSTA 
pointing to the relative address of VOB-C angle interleave unit A-ILVUc2 synchronized to the same (next) angle inter- 
leave unit A-ILVUb2 as shown by line Pblc; and SML_AGL_C#3_DSTA pointing to the relative address of the VOB-D 
angle interleave unit A-ILVUd2 as shown by line Pb1d. 

[0399] As shown by lines Pb2b, Pb2c, and Pb2d, the navigation pack NV of the next angle interleave unit A-ILVUb2 
40 in the same video object VOB-B likewise contains SML_AGL_C#1_DSTA, SML_AGL_C#2_DSTA, and 
SML_AGL_C#3_DSTA pointing to the relative address of the next angle interleave units A-ILVUb3, A-lLVUc3, and A- 
ILVUd3 in VOB-B, VOB-C, and VOB-D, respectively. 

[0400] Note that the relative addresses are all expressed as the number of sectors from the navigation pack NV of 
the VOBU contained in each interleave unit. 
45 [0401] Addresses are similarly written to the navigation pack NV of the first angle interleave unit A-ILVUc1 in VOB- 

C, i.e., SML_AGL_C#2_DSTA pointing to the relative address of the next angle interleave unit A-ILVUc2 in VOB-C as 
shown by line Pc1c; SML_AGL_C#1_DSTA pointing to the relative address of the same (next) angle interleave unit A- 
ILVUb2 in VOB-B, as shown by line Pc1b; and SML_AGL_C#3_DSTA pointing to the relative address of the VOB-D 
angle interleave unit A-ILVUd2 as shown by line Pcld. 

so [0402] As shown by lines Pc2c, Pc2b, and Pc2d, the navigation pack NV of the next angle interleave unit A-ILVUc2 
in VOB-C likewise contains the relative addresses SML_AGL_C#2_DSTA, SML_AGL_C#1_DSTA, and 
SML__AGL_C#3_DSTA pointing to the next angle interleave units A-ILVUc3, A-ILVUb3, and A-ILVUd3 in the respective 
video objects VOB. 

[0403] Again note that the relative addresses are all expressed as the number of sectors from the navigation pack 
55 NV of the VOBU contained in each interleave unit. 

[0404] Addresses are similarly written to the navigation pack NV of the first angle interleave unit A-ILVUd1 in VOB- 

D, i.e., SML_AGL_C#3_DSTA pointing to the relative address of the next angle interleave unit A-ILVUd2 in VOB-D as 
shown by line Pdld; SML_ AGL_C#1_DSTA pointing to the relative address of the same (next) angle interleave unit 
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A-ILVUb2 in VOB-B, as shown by line Pdlb; and SML_AGL C#2 DSTA pointing to the relative address of VOB-C angle 
interleave unit A-ILVUc2 as shown by line Pd1c. 

[0405] As shown by lines Pd2d, Pd2b, and Pd2c, the navigation pack NV of the next angle interleave unit A-ILVUd2 
in VOB-D likewise contains the relative addresses SML_AGL_C#3_DSTA, SML_AGL_C#1_DSTA, and 
5 SML_AGL_C#2_DSTA pointing to the next angle interleave units A-ILVUd3, A-ILVUb3, and A-ILVUc3 in the respective 
video objects VOB. 

[0406] Again note that the relative addresses are all expressed as the number of sectors from the navigation pack 
NV of the VOBU contained in each interleave unit. 

[0407] In addition to the addresses SML_AGLJ3#1_DSTA-SML_AGL_C#9_DSTA, each navigation pack NV also 
10 contains various parameter data as previously described with respectto Fig. 20, and further description thereof is thus 
omitted for simplicity. 

[0408] This address information more specifically includes in the navigation pack NV of angle interleave unit A- 
ILVUbl , for example, the end address ILVU_EA of the angle interleave unit A-ILVUb1 to which the navigation pack NV 
belongs, and the addresses SML_AGL_C#1_DSTA, SML_AGL_C#2_DSTA, and SML_AGL_C#3_DSTA of the navi- 

15 gation packs NV of the next angle interleave units A-ILVUb2, A-ILVUc2, A-ILVUd2 that can be reproduced. The navi- 
gation pack NV of A-ILVUb2 contains the end address ILVU_EA of A-ILVUb2 J and the addresses 
SML_AGL_C#1_DSTA, SML_AGL_C#2_DSTA, and SML_AGL_C#3_DSTA of the navigation packs NV of the next 
angle interleave units A-ILVUb3, A-ILVUc3, A-ILVUd3 reproduced. The navigation pack NV of A-ILVUb3, the last in- 
terleave unit in the sequence in this example, contains the end address ILVU_EA of A-lLVUb3, and termination infor- 

20 mation as the address of the navigation pack NV of the next A-ILVU reproduced, e.g., a NULL value or string of all 1s 
as the ILVILEA. 

[0409] The specifics of the address information written to the other video objects VOB-C and VOB-D in this example 
are the same. 

[0410] By writing this address information to the navigation packs NV of each angle interleave unit A-ILVU, it is 
25 possible to read ahead the address of the next A-ILVU to be reproduced in chronological sequence, and is therefore 
suited to seamless reproduction. Furthermore, because the address of the next A-ILVU in each of the other available 
angles is also recorded to each navigation pack NV, the next address of the selected angle can be easily obtained 
without giving special consideration to whether the angle is changed or not, and jumping to the next interleave unit can 
be controlled by the same control sequence. 
30 [041 1 ] By thus recording the relative addresses of each A-ILVU that can be selected from each angle, and construct- 
ing the encoded video stream contained in each A-ILVU from closed GOP, seamless reproduction without video dis- 
ruption can be achieved when the viewing angle is changed. 

[0412] The audio data can also be seamlessly reproduced if the same audio is recorded for each angle, if the audio 
is complete within each ILVU, or if discrete audio data is recorded. Moreover, if identical audio data is recorded to each 
35 interleaved unit ILVU, the listener will not even be able to discern that the audio track has changed if the audio is 
seamlessly reproduced across angle changes. 

[0413] A data structure enabling non-seamless reproduction of angle changes, i.e., seamless data reproduction 
permitting perceptible breaks in the content of the reproduced information, is described below with reference to Fig. 50. 
[0414] it is assumed in Fig. 50 that multi-angle data VOB-B comprises three video object units VOBUbl , VOBUb2, 

40 and VOBUb3. Multi -angle data VOB-C similarly comprises three video object units VOBUd , VOBUc2, and VOBUc3, 
and VOB-D comprises three video object units VOBUdl , VOBUd2, and VOBUd3. As in the case shown in Fig 49, the 
navigation pack NV of each video object unit VOBU contains the end pack address VOBU_EA of each VOBU. Note 
that this end pack address VOBU_EA is the address of a navigation pack NV in a VOBU comprising a navigation pack 
N V and one or more other packs. In this example, however, the NSML_AGL_C#_DSTA address written to the navigation 

45 pack NV of each VOBU does not describe a chronologically later VOBU address, but the address of a VOBU in a 
different angle with a reproduction time preceding the time the angle is changed. 

[0415] In other words, addresses N S M L_ A G L_C# 1 _D STA to N S M L_AG L_C#9_D STA to VOBU in other angles syn- 
chronized to the current VOBU are recorded. As above, #1 - #9 represent different angle numbers. For fields in which 
an angle of the corresponding number does not exist, a value, e.g., 0, indicating that the angle does not exist is recorded. 
50 Thus, the navigation pack NV of video object unit VOBUbl of multi-angle data VOB-B contains NSML AGL_C#2_DSTA 
and NSML_AGL_C#3_DSTA, the relative addresses of the synchronized VOBUd and VOBUdl in VOB-C and VOB- 
D as shown by lines Pb1c* and Pb1d'. 

[0416] The navigation pack NV of video object unit VOBUb2 similarly contains as shown by lines Pb2c' and Pb2d' 
the relative addresses N S M L_AG L_C#2_DSTA and N SM L_AGL_C#3_DSTA of the synchronized VOBUc2 and 
55 VOBUd2. The navigation pack NV of video object unit VOBUb3 similarly contains as shown by lines Pb3c' and Pb3d' 
the relative addresses NSM L_AG L_C#2_DSTA and NSML_AGL_C#3_DSTA of the synchronized VOBUc3 and 
VOBUd3. 

[0417] The navigation packs NV of video object units VOBUd , VOBUc2, and VOBUc3 of VOB-C similarly contain 
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the relative addresses N S M L_AG L_C# 1 _DSTA and NSML_AGL_C#3_DSTA of the respectively synchronized VOBU 
as shown by lines Pdb', Pdd', Pc2b\ Pc2d', Pc3b\ and Pc3d'. 

[0418] The navigation packs NV of video object units VOBUdl , VOBUd2, and VOBUd3 of VOB-D likewise contain 
the relative addresses NSML_AGL_C#1_DSTA and NSML_AGL_C#2_DSTA of the respectively synchronized VOBU 

5 as shown by lines Pd1b', Pd1c\ Pd2b\ Pd2c\ Pd3b', and Pd3c'. 

[0419] The angle selection address information NSML_AGL_C#4_DSTA to NSML_AGL_C#9_DSTA for non-existent 
angles #4 - #9 in this example record a value indicating that corresponding angles do not exist, e.g., a 0 value. 
[0420] During reproduction of angle data with this data structure, the DVD decoder interrupts reproduction of the 
angle VOBU data currently being reproduced when the angle is changed, then reads and begins reproducing the VOBU 

10 data for the selected angle. 

[0421] Note that while there appears to be a time offset between the interleaved units of VOB-C relative to VOB-B 
and VOB-D in Fig. 50, this is merely to facilitate easier understanding of the address descriptions written to the respec- 
tive navigation packs NV of each VOB using the fine arrows as shown. There is, in fact, no time shifting between the 
respective interleaved video objects VOB, which are aligned as shown in Fig. 49. 

15 [0422] The data structure shown in Fig. 50 thus contains as pointers to the next VOBU to be reproduced addresses 
to another VOBU that should normally be reproduced either simultaneously with or chronologically before the VOBU 
containing the address information. Therefore, when the angle is changed, reproduction is continued from a scene at 
a past point in time. This method of encoding the address information is thus preferable when seamless angle switching 
not required, i.e., for non-seamless information reproduction in which contiguity is not required in the reproduced in- 

20 formation. 

Flow chart: encoder 

[0423] The encoding information table generated by the encoding system controller 200 from information extracted 

25 from the scenario data St7 is described below referring to Fig. 27. 

[0424] The encoding information table contains VOB set data streams containing plural VOB corresponding to the 
scene periods beginning and ending at the scene branching and connecting points, and VOB data streams correspond- 
ing to each scene. These VOB set data streams shown in Fig. 27 are the encoding information tables generated at 
step #1 00 in Fig. 51 by the encoding system controller 200 for creating the DVD multimedia stream based on the user- 

30 defined title content. 

[0425] The user-defined scenario contains branching points from common scenes to plural scenes, or connection 
points to other common scenes. The VOB corresponding to the scene period delimited by these branching and con- 
necting points is a VOB set, and the data generated to encode a VOB set is the VOB set data stream. The title number 
specified by the VOB set data stream is the title number TITLE_NO of the VOB set data stream. 

35 [0426] The VOB Set data structure in Fig. 27 shows the data content for encoding one VOB set in the VOB set data 
stream, and comprises: the VOB set number VOBS_NO, the VOB number VOB_NO in the VOB set, the preceding 
VOB seamless connection flag VOB_Fsb, the following VOB seamless connection flag VOB_Fsf, the multi-scene flag 
VOB_Fp, the interleave flag VOB_Fi, the multi-angle flag VOB_Fm, the multi-angle seamless switching flag VOB_FsV, 
the maximum bit rate of the interleaved VOB ILV_BR, the number of interleaved VOB divisions ILV_DIV, and the min- 

40 imum interleaved unit presentation time ILVU_MT. 

[0427] The VOB set number VOBSJvIO is a sequential number identifying the VOB set and the position of the VOB 
set in the reproduction sequence of the title scenario. 

[0428] The VOB number VOB_NO is a sequential number identifying the VOB and the position of the VOB in the 
reproduction sequence of the title scenario. 
45 [0429] The preceding VOB seamless connection flag VOB_Fsb indicates whether a seamless connection with the 
preceding VOB is required for scenario reproduction. 

[0430] The following VOB seamless connection flag VOB_Fsf indicates whether there is a seamless connection with 
the following VOB during scenario reproduction. 

[0431] The multi-scene flag VOB_Fp identifies whether the VOB set comprises plural video objects VOB. 
so [0432] The interleave flag VOB_Fi identifies whether the VOB in the VOB set are interleaved. 
[0433] The multi-angle flag VOB_Fm identifies whether the VOB set is a multi-angle set. 

[0434] The multi-angle seamless switching flag VOB_FsV identifies whether angle changes within the multi-angle 
scene period are seamless or not. 

[0435] The maximum bit rate of the interleaved VOB ILVJ3R defines the maximum bit rate of the interleaved VOBs. 
55 [0436] The number of interleaved VOB divisions ILV_DIV identifies the number of interleave units in the interleaved 
VOB. 

[0437] The minimum interleave unit presentation time ILVU_MT defines the time that can be reproduced when the 
bit rate of the smallest interleave unit at which a track buffer data underflow state does not occur is the maximum bit 
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rate of the interleaved VOB ILV_BR during interleaved block reproduction. 

[0438] The encoding information table for each VOB generated by the encoding system controller 200 based on the 
scenario data St7 is described below referring to Fig. 28. The VOB encoding parameters described below and supplied 
to the video encoder 300, audio encoder 700, and system encoder 900 for stream encoding are produced based on 
this encoding information table. 

[0439] The VOB data streams shown in Fig. 28 are the encoding information tables generated at step #100 in Fig. 
51 by the encoding system controller 200 for creating the DVD multimedia stream based on the user-defined title 
content. 

[0440] The encoding unit is the video object VOB, and the data generated to encode each video object VOB is the 
VOB data stream. For example, a VOB set comprising three angle scenes comprises three video objects VOB. The 
data structure shown in Fig. 28 shows the content of the data for encoding one VOB in the VOB data stream. 
[0441] The VOB data structure contains the video material start time VOB_VST, the video material end time 
VOB_VEND, the video signal type VOB_V_KIND, the video encoding bit rate V_BR, the audio material start time 
VOB_AST, the audio material end time VOB_AEND, the audio coding method VOB_A_KIND, and the audio encoding 
bit rate A_BR. 

[0442] The video material start time VOB_VST is the video encoding start time corresponding to the time of the video 
signal. 

[0443] The video material end time VOB_VEND is the video encoding end time corresponding to the time of the 
video signal. 

[0444] The video material type VOB_V_KIND identifies whether the encoded material is in the NTSC or PAL format, 
for example, or is photographic material (a movie, for example) converted to a television broadcast format (so-called 
telecine conversion). 

[0445] The video encoding bit rate VJ3R is the bit rate at which the video signal is encoded. 

[0446] The audio material start time VOB_AST is the audio encoding start time corresponding to the time of the audio 

signal. 

[0447] The audio material end time VOB_AEND is the audio encoding end time corresponding to the time of the 
audio signal. 

[0448] The audio coding method VOB_A_KIND identifies the audio encoding method as AC-3, MPEG, or linear PCM, 
for example. 

[0449] The audio encoding bit rate A_BR is the bit rate at which the audio signal is encoded. 
[0450] The encoding parameters used by the video encoder 300, sub-picture encoder 500, and audio encoder 700, 
and system encoder 900 for VOB encoding are shown in Fig. 29. The encoding parameters include: the VOB number 
VOB_NO, video encode start time V_STTM, video encode end time V_ENDTM, the video encode mode V_ENCMD, 
the video encode bit rate V RATE, the maximum video encode bit rate V_MRATE, the GOP structure fixing flag 
GOP_Fxflag, the video encode GOP structure GOPST, the initial video encode data VJNTST, the last video encode 
data V_ENDST, the audio encode start time A_STTM, the audio encode end time A_ENDTM, the audio encode bit 
rate A_RATE, the audio encode method A_ENCMD, the audio start gap A_STGAP, the audio end gap A_ENDGAP, 
the preceding VOB number B_VOB_NO, and the following VOB number F_VOB_NO. 

[0451] The VOB number VOB_NO is a sequential number identifying the VOB and the position of the VOB in the 
reproduction sequence of the title scenario. 

[0452] The video encode start time V_STTM is the start time of video material encoding. 
[0453] The video encode end time V_ENDTM is the end time of video material encoding. 

[0454] The video encode mode V_ENCMD is an encoding mode for declaring whether reverse telecine conversion 
shall be accomplished during video encoding to enable efficient coding when the video material is telecine converted 
material. 

[0455] The video encode bit rate V_RATE is the average bit rate of video encoding. 

[0456] The maximum video encode bit rate V_MRATE is the maximum bit rate of video encoding. 

[0457] The GOP structure fixing flag GOP_Fxflag specifies whether encoding is accomplished without changing the 

GOP structure in the middle of the video encoding process. This is a useful parameter for declaring whether seamless 

switch is enabled in a multi-angle scene period. 

[0458] The video encode GOP structure GOPST is the GOP structure data from encoding. 

[0459] The initial video encode data VJNTST sets the initial value of the VBV buffer (decoder buffer) at the start of 
video encoding, and is referenced during video decoding to initialize the decoding buffer. This is a useful parameter 
for declaring seamless reproduction with the preceding encoded video stream. 

[0460] The last video encode data V_ENDST sets the end value of the VBV buffer (decoder buffer) at the end of 
video encoding, and is referenced during video decoding to initialize the decoding buffer. This is a useful parameter 
for declaring seamless reproduction with the preceding encoded video stream. 
[0461] The audio encode start time A_STTM is the start time of audio material encoding. 
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[0462] The audio encode end time A_ENDTM is the end time of audio material encoding. 
[0463] The audio encode bit rate A_RATE is the bit rate used for audio encoding. 

[0464] The audio encode method A_ENCMD identifies the audio encoding method as AC-3, MPEG, or linear PCM, 
for example. 

5 [0465] The audio start gap A_STGAP is the time offset between the start of the audio and video presentation at the 
beginning of a VOB. This is a useful parameter for declaring seamless reproduction with the preceding encoded system 
stream. 

[0466] The audio end gap A_ENDGAP is the time offset between the end of the audio and video presentation at the 
end of a VOB. This is a useful parameter for declaring seamless reproduction with the preceding encoded system 
10 mt „ stream. 

[0467] The preceding VOB number B_VOB_NO is the VOB_NO of the preceding VOB when there is a seamlessly 
connected preceding VOB. 

[0468] The following VOB number F_VOB_NO is the VOB_NO of the following VOB when there is a seamlessly 
connected following VOB. 

15 [0469] The operation of a DVD encoder ECD according to the present invention is described below with reference 
to the flow chart in Fig. 51 . Note that the steps shown with a double line are subroutines. It should be obvious that 
while the operation described below relates specifically in this case to the DVD encoder ECD of the present invention, 
the operation described also applies to an authoring encoder EC. 

[0470] At step #100, the user inputs the editing commands according to the user-defined scenario while confirming 
20 the content of the multimedia source data streams St1 , St2, and St3. 

[0471] At step #200, the scenario editor 100 generates the scenario data St7 containing the above edit command 
information according to the user's editing instructions. 

[0472] When generating the scenario data St7 in step #200, the user editing commands related to multi-angle and 
parental lock multi-scene periods in which interleaving is presumed must be input to satisfy the following conditions. 

25 [0473] First, the VOB maximum bit rate must be set to assure sufficient image quality, and the track buffer capacity, 
jump performance, jump time, and jump distance of the DVD decoder DCD used as the reproduction apparatus of the 
DVD encoded data must be determined. Based on these values, the reproduction time of the shortest interleaved unit 
is obtained from equations 3 and 4. Based on the reproduction time of each scene in the multi-scene period, it must 
then be determined whether equations 5 and 6 are satisfied. If equations 5 and 6 are not satisfied, the user must 

30 change the edit commands until equations 5 and 6 are satisfied by, for example, connecting part of the following scene 
to each scene in the multi-scene period. 

[0474] When multi-angle edit commands are used, equation 7 must be satisfied for seamless switching, and edit 
commands matching the audio reproduction time with the reproduction time of each scene in each angle must be 
entered. If non-seamless switching is used, the user must enter commands to satisfy equation 8. 
35 [0475] At step #300, the encoding system controller 200 first determines whether the target scene is to be seamlessly 
connected to the preceding scene based on the scenario data St7. 

[0476] Note that when the preceding scene period is a multi-scene period comprising plural scenes but the presently 
selected target scene is a common scene (not in a multi-scene period), a seamless connection refers to seamlessly 
connecting the target scene with any one of the scenes contained in the preceding multi-scene period. When the target 
40 scene is a multi-scene period, a seamless connection still refers to seamlessly connecting the target scene with any 
one of the scenes from the same multi-scene period. 

[0477] If step #300 returns NO, i.e., a non-seamless connection is valid, the procedure moves to step #400. 
[0478] At step #400, the encoding system controller 200 resets the preceding VOB seamless connection flag 
VOB_Fsb indicating whether there is a seamless connection between the target and preceding scenes. The procedure 
45 then moves to step #600. 

[0479] On the other hand, if step #300 returns YES, i.e., there is a seamless connection to the preceding scene, the 
procedure moves to step #500. 

[0480] At step #500 the encoding system controller 200 sets the preceding VOB seamless connection flag VOB Fsb. 
The procedure then moves to step #600. 
so [0481] At step #600 the encoding system controller 200 determines whether there is a seamless connection between 
the target and following scenes based on scenario data St7. If step #600 returns NO, i.e., a non-seamless connection 
is valid, the procedure moves to step #700. 

[0482] At step #700, the encoding system controller 200 resets the following VOB seamless connection flag VOB_Fsf 
indicating whether there is a seamless connection with the following scene. The procedure then moves to step #900. 
55 [0483] However, if step #600 returns YES, i.e., there is a seamless connection to the following scene, the procedure 
moves to step #800. 

[0484] At step #800 the encoding system controller 200 sets the following VOB seamless connection flag VOB_Fsf . 
The procedure then moves to step #900. 
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[0485] At step #900 the encoding system controller 200 determines whether there is more than connection target 
scene, i.e., whether a multi-scene period is selected, based on the scenario data St7. As previously described, there 
are two possible control methods in multi-scene periods: parental lock control whereby only one of plural possible 
reproduction paths that can be constructed from the scenes in the multi-scene period is reproduced, and multi-angle 

s control whereby the reproduction path can be switched within the multi-scene period to present different viewing angles. 
[0486] If step #900 returns NO, i.e., there are not multiple scenes, the procedure moves to step #1 000. 
[0487] At step #1 000 the multi-scene flag VOB_Fp identifying whether the VOB set comprises plural video objects 
VOB (a multi-scene period is selected) is reset, and the procedure moves to step #1800 for encode parameter pro- 
duction. This encode parameter production subroutine is described below. 

10 [0488] However, if step #900 returns YES, there is a multi-sceneconnection, the procedure moves to step #11 00. 
[0489] At step #1100, the multi-scene flag VOB_Fp is set, and the procedure moves to step #1200 whereat it is 
judged whether a multi-angle connection is selected, or not. 

[0490] At step #1200 it is determined whether a change is made between plural scenes in the multi-scene period, i. 
e., whether a multi-angle scene period is selected. If step #1200 returns NO, i.e., no scene change is allowed in the 
15 multi-scene period as parental lock control reproducing only one reproduction path has been selected, the procedure 
moves to step #1300. 

[0491] At step #1 300 the multi-angle flag VOB_Fm identifying whether the target connection scene is a multi-angle 
scene is reset, and the procedure moves to step #1302. 

[0492] At step #1302 it is determined whether either the preceding VOB seamless connection flag VOB_Fsb or 
20 following VOB seamless connection flag VOB_Fsf is set. If step #1302 returns YES, i.e., the target connection scene 
seamlessly connects to the preceding, the following, or both the preceding and following scenes, the procedure moves 
to step #1304. 

[0493] At step #1 304 the interleave flag VOB_Fi identifying whether the VOB, the encoded data of the target scene, 
is interleaved is set. The procedure then moves to step #1800. 
25 [0494] However, if step #1302 returns NO, i.e., the target connection scene does not seamlessly connect to the 
preceding or following scene, the procedure moves to step #1306. 

[0495] At step #1 306 the interleave flag VOB_Fi is reset, and the procedure moves to step #1 800. 

[0496] If step #1200 returns YES, however, i. e., there is a multi-angle connection, the procedure moves to step 

#1400. 

30 [0497] At step #1400, the multi-angle flag VOB_Fm and interleave flag VOB_Fi are set, and the procedure moves 
to step #1500. 

[0498] At step #1 500 the encoding system controller 200 determines whether the audio and video can be seamlessly 
switched in a multi-angle scene period, i.e., at a reproduction unit smaller than the VOB, based on the scenario data 
St7. If step #1500 returns NO, i.e., non-seamless switching occurs, the procedure moves to step #1600. 
35 [0499] At step #1 600 the multi-angle seamless switching flag VOB_FsV indicating whether angle changes within the 
multi-angle scene period are seamless or not is reset, and the procedure moves to step #1800. 
[0500] However, if step #1500 returns YES, i.e., seamless switching occurs, the procedure moves to step #1 700. 
[0501] At step #1700 the multi-angle seamless switching flag VOB_FsV is set, and the procedure moves to step 
#1800. 

40 [0502] Therefore, as shown by the flow chart in Fig. 51 , encode parameter production (step #1800) is only begun 
after the editing information is detected from the above flag settings in the scenario data St7 reflecting the user-defined 
editing instructions. 

[0503] Based on the user-defined editing instructions detected from the above flag settings in the scenario data St7, 
information is added to the encoding information tables for the VOB Set units and VOB units as shown in Figs. 27 and 
45 28 to encode the source streams, and the encoding parameters of the VOB data units shown in Fig. 29 are produced, 
in step #1800. The procedure then moves to step #1900 for audio and video encoding. 

[0504] The encode parameter production steps (step #1 800) are described in greater detail below referring to Figs. 
52, 53, 54, and 55. 

[0505] Based on the encode parameters produced in step #1 800, the video data and audio data are encoded in step 

so #1 900, and the procedure moves to step #2000. 

[0506] Note that the sub-picture data is normally inserted during video reproduction on an as-needed basis, and 
contiguity with the preceding and following scenes is therefore not usually necessary. Moreover, the sub-picture data 
is normally video information for one frame, and unlike audio and video data having an extended time-base, sub-picture 
data is usually static, and is not normally presented continuously. Because the present invention relates specifically to 

55 seamless and non-seamless contiguous reproduction as described above, description of sub-picture data encoding is 
omitted herein for simplicity. 

[0507] Step #2000 is the last step in a loop comprising steps #300 to step #2000, and causes this loop to be repeated 
as many times as there are VOB Sets. This loop formats the program chain VTS_PGC#i to contain the reproduction 
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sequence and other reproduction information for each VOB in the title (Fig. 16) in the program chain data structure, 
interleaves the VOB in the multi-scene periods, and completes the VOB Set data stream and VOB data stream needed 
for system stream encoding. The procedure then moves to step #2100. 

[0508] At step #2100 the VOB Set data stream is completed as the encoding information table by adding the total 
5 number of VOB Sets VOBSJMUM obtained as a result of the loop through step #2000 to the VOB Set data stream, 
and setting the number of titles TITLE_NO defining the number of scenario reproduction paths in the scenario data 
St7. The procedure then moves to step #2200. 

[0509] System stream encoding producing the VOB (VOB#i) data in the VTS title VOBS (VTSTT_VOBS) (Fig. 16) 
is accomplished in step #2200 based on the encoded video stream and encoded audio stream output from step #1900, 

10 and the encode parameters in Fig. 29. The procedure then moves to step #2300. 

[0510] At step #2300 the VTS information VTSI, VTSI management table VTSLMAT, VTSPGC information table 
VTS_PGCIT, and the program chain information VTS_PGCI#i controlling the VOB data reproduction sequence shown 
in Fig. 16 are produced, and formatting to, for example, interleave the VOB contained in the multi-scene periods, is 
accomplished. The specific steps executed in this formatting operation are described below with reference to Figs. 56, 

15 57, 58, 59, and 60. 

[051 1 ] The encode parameter production subroutine shown as step #1 800 in Fig. 51 B is described next using Figs. 
52, 53, and 54 using by way of example the operation generating the encode parameters for multi-angle control. 
[0512] Starting from Fig. 52, the process for generating the encode parameters of a non-seamless switching stream 
with multi-angle control is described first. This stream is generated when step #1500 in Fig. 51 returns NO and the 
20 following flags are set as shown: VOB_Fsb = 1 or VOB_Fsf = 1 , VOB_Fp = 1 , VOB_Fi = 1 , VOB_Fm = 1 , and VOB_FsV 
= 0. The following operation produces the encoding information tables shown in Fig. 27 and Fig. 28, and the encode 
parameters shown in Fig. 29. 

[0513] At step #1812, the scenario reproduction sequence (path) contained in the scenario data St7 is extracted, 
the VOB Set number VOBS_NO is set, and the VOB number VOB_NO is set for one or more VOB in the VOB Set. 
25 [0514] At step #1814 the maximum bit rate ILV_BR of the interleaved VOB is extracted from the scenario data St7, 
and the maximum video encode bit rate VJMRATE from the encode parameters is set based on the interleave flag 
VOB_Fi setting (=1). 

[0515] At step #1816, the minimum interleaved unit presentation time ILVU_MT is extracted from the scenario data 
St7. 

30 [0516] At step #1818, the video encode GOP structure GOPST values N = 15 and M = 3 are set, and the GOP 
structure fixing flag GOP_Fxflag is set (= 1), based on the multi-scene flag VOB_Fp setting (= 1). 
[0517] Step #1820 is the common VOB data setting routine, which is described below referring to the flow chart in 
Fig. 53. This common VOB data setting routine produces the encoding information tables shown in Figs. 27 and 28, 
and the encode parameters shown in Fig. 29. 

35 [0518] At step #1 822 the video material start time VOB_VST and video material end time VOB_VEND are extracted 
for each VOB, and the video encode start time V_STTM and video encode end time V_ENDTM are used as video 
encoding parameters. 

[0519] At step #1824 the audio material start time VOB_AST of each VOB is extracted from the scenario data St7, 
and the audio encode start time A_STTM is set as an audio encoding parameter. 
40 [0520] At step #1 826 the audio material end time VOB„AEND is extracted for each VOB from the scenario data St7, 
and at a time not exceeding the VOB_AEND time. This time extracted at an audio access unit (AAU) is set as the audio 
encode end time A_ENDTM which is an audio encoding parameter. Note that the audio access unit AAU is determined 
by the audio encoding method. 

[0521] At step #1828 the audio start gap A_STGAP obtained from the difference between the video encode start 
45 time V_STTM and the audio encode start time A_STTM is defined as a system encode parameter. 

[0522] At step #1830 the audio end gap A_ENDGAP obtained from the difference between the video encode end 
time V_ENDTM and the audio encode end time A_ENDTM is defined as a system encode parameter. 
[0523] At step #1 832 the video encoding bit rate V_BR is extracted from the scenario data St7, and the video encode 
bit rate V_RATE, which is the average bit rate of video encoding, is set as a video encoding parameter. 
50 [0524] At step #1 834 the audio encoding bit rate A_BR is extracted from the scenario data St7, and the audio encode 
bit rate A_RATE is set as an audio encoding parameter. 

[0525] At step #1836 the video material type VOB_V_KIND is extracted from the scenario data St7. If the material 
is a film type, i.e., a movie converted to television broadcast format (so-called telecine conversion), reverse telecine 
conversion is set for the video encode mode V_ENCMD, and defined as a video encoding parameter. 
55 [0526] At step #1838 the audio coding method VOB_A_KIND is extracted from the scenario data St7, and the en- 
coding method is set as the audio encode method A ENCMD and set as an audio encoding parameter. 
[0527] At step #1840 the initial video encode data VJNTST sets the initial value of the VBV buffer to a value less 
than the VBV buffer end value set by the last video encode data V_ENDST, and defined as a video encoding parameter. 
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[0528] At step #1 842 the VOB number VOBJMO of the preceding connection is set to the preceding VOB number 
B_VOB_NO based on the setting (= 1 ) of the preceding VOB seamless connection flag VOB_Fsb, and set as a system 
encode parameter. 

[0529] At step #1844 the VOB number VOB_NO of the following connection is set to the following VOB number 
5 F_VOB_NO based on the setting (= 1 ) of the following VOB seamless connection flag VOB_Fsf , and set as a system 
encode parameter. 

[0530] The encoding information table and encode parameters are thus generated for a multi-angle VOB Set with 
non-seamless multi-angle switching control enabled. 

[0531] The process for generating the encode parameters of a seamless switching stream with multi-angle control 
w is described below with reference to Fig. 54. This stream is generated when step #1500 in Fig. 51 returns YES and 
the following flags are set as shown: VOB_Fsb = 1 or VOB_Fsf = 1, VOB_Fp = 1, VOB_Fi = 1, VOB_Fm = 1, and 
VOB_FsV = 1. The following operation produces the encoding information tables shown in Fig. 27 and Fig. 28, and 
the encode parameters shown in Fig. 29. 

[0532] The following operation produces the encoding information tables shown in Fig. 27 and Fig. 28, and the 

15 encode parameters shown in Fig. 29. 

[0533] At step #1850, the scenario reproduction sequence (path) contained in the scenario data St7 is extracted, 
the VOB Set number VOBS_NO is set, and the VOB number VOB_NO is set for one or more VOB in the VOB Set. 
[0534] At step #1 852 the maximum bit rate ILV_BR of the interleaved VOB is extracted from the scenario data St7, 
and the maximum video encode bit rate VJvlRATE from the encode parameters is set based on the interleave flag 

20 VOB_Fi setting (=1). 

[0535] At step #1 854, the minimum interleaved unit presentation time ILVU_MT is extracted from the scenario data 
St7. 

[0536] At step #1856, the video encode GOP structure GOPST values N = 15 and M = 3 are set, and the GOP 
structure fixing flag GOP_Fxflag is set (= 1), based on the multi-scene flag VOB_Fp setting (= 1). 
25 [0537] At step #1858, the video encode GOP GOPST is set to "closed GOP" based on the multi-angle seamless 
switching flag VOB_FsV setting (= 1), and the video encoding parameters are thus defined. 

[0538] Step #1 860 is the common VOB data setting routine, which is as described referring to the flow chart in Fig. 
52. Further description thereof is thus omitted here. 

[0539] The encode parameters of a seamless switching stream with multi-angle control are thus defined for a VOB 

30 Set with multi-angle control as described above. 

[0540] The process for generating the encode parameters for a system stream in which parental lock control is 
implemented is described below with reference to Fig. 55. This stream is generated when step #1200 in Fig. 51 returns 
NO and step #1304 returns YES, i.e., the following flags are set as shown: VOB_Fsb = 1 or VOB_Fsf = 1 , VOB_Fp = 
1 , VOB_Fi = 1 , VOB_Fm = 0. The following operation produces the encoding information tables shown in Fig. 27 and 

35 Fig. 28, and the encode parameters shown in Fig. 29. 

[0541] At step #1870, the scenario reproduction sequence (path) contained in the scenario data St7 is extracted, 
the VOB Set number VOBS_NO is set, and the VOB number VOB_NO is set for one or more VOB in the VOB Set. 
[0542] At step #1 872 the maximum bit rate ILV_BR of the interleaved VOB is extracted from the scenario data St7, 
and the maximum video encode bit rate V_MRATE from the encode parameters is set based on the interleave flag 

40 VOB_Fi setting (= 1). 

[0543] At step #1 872 the number of interleaved VOB divisions 1LV_DI V is extracted from the scenario data St7. 
[0544] Step #1 876 is the common VOB data setting routine, which is as described referring to the flow chart in Fig. 
52. Further description thereof is thus omitted here. 

[0545] The encode parameters of a system stream in which parental lock control is implemented are thus defined 
45 for a VOB Set with multi-scene selection control enabled as described above. 

[0546] The process for generating the encode parameters for a system stream containing a single scene is described 
below with reference to Fig. 61 . This stream is generated when step #900 in Fig. 51 returns NO, i.e., when VOB_Fp=0. 
The following operation produces the encoding information tables shown in Fig. 27 and Fig. 28, and the encode pa- 
rameters shown in Fig. 29. 

so [0547] At step #1880, the scenario reproduction sequence (path) contained in the scenario data St7 is extracted, 
the VOB Set number VOBS_NO is set, and the VOB number VOB_NO is set for one or more VOB in the VOB Set. 
[0548] At step #1 882 the maximum bit rate ILV_BR of the interleaved VOB is extracted from the scenario data St7, 
and the maximum video encode bit rate V_MRATE from the encode parameters is set based on the interleave flag 
VOB_Fi setting (= 1). 

55 [0549] Step #1 884 is the common VOB data setting routine, which is as described referring to the flow chart in Fig. 
52. Further description thereof is thus omitted here. 

[0550] These flow charts for defining the encoding information table and encode parameters thus generate the pa- 
rameters for DVD video, audio, and system stream encoding by the DVD formatter. 
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Formatter flows 

[0551] The operation of the subroutine executed by the DVD formatter shown as step #2300 in Fig. 51 B is described 
next with reference to Figs. 56, 57, 58, 59, and 60. This formatter subroutine generates the DVD multimedia bitstream. 

5 [0552] The operation of the DVD encoder ECD 1 1 00 according to the present invention is described with reference 
to the flow chart in Fig. 56. Note that those steps shown in Fig. 56 with a double line are subroutines. 
[0553] At step #231 0 the program chain information VTS_PGCI is set to the VTSI management table VTSI_MAT for 
the number of titles TITLE_NUM based on the number of titles TITLE_NUM in the VOB Set data stream, 
[0554] At step #231 2 it is determined whether multi-scene selection control is enabled based on the multi-scene flag 

10 VOB_Fp in the VOB Set data stream. If step #2312 returns NO, i.e., multi-scene control is not enabled, the procedure 
moves to step #2114. 

[0555] At step #231 4 the operation for coding a single scene (VOB) executed by the formatter 1100 of the authoring 
encoder EC shown in Fig. 25 is accomplished. This routine is described later. 

[0556] If step #2312 returns YES, i.e., multi-scene control is enabled, the procedure moves to step #211 6. 
15 [0557] At step #231 6 it is determined whether the information is to be interleaved or not based on the interleave flag 
VOB_Fi state in the VOB Set data stream. If step #2316 returns NO, i.e., the information is not to be interleaved, the 
procedure moves to step #2314. If step #2316 returns YES, i.e., the information is to be interleaved, the procedure 
moves to step #2318. 

[0558] At step #231 8 it is determined whether multi-angle control is to be implemented based on the multi-angle flag 
20 VOB_Fm in the VOB Set data stream. If step #2318 returns NO, the parental lock control routine in step #2320 is 
executed. If step #2318 returns YES, the procedure moves to step #2322. 

[0559] At step #2320 the operation for formatting the VOB Set for parental lock control is executed. This subroutine 
is shown in Fig. 59 and described below. 

[0560] At step #2322 it is determined whether multi-angle seamless switching is required based on the multi-angle 
25 seamless switching flag VOB_FsV. If multi-angle switching is accomplished without seamless switching, i.e., with non- 
seamless switching and step #2322 returns NO, the procedure moves to step #2326. 

[0561] The multi-angle non-seamless switching control routine executed in step #2326 by the formatter 1100 of the 
authoring encoder EC in Fig. 25 is described later with reference to Fig. 57. 

[0562] If multi-angle switching is accomplished with seamless switching control, i.e., step #2322 returns YES, the 
30 procedure moves to step #2324. 

[0563] The multi-angle seamless switching control routine executed in step #2324 by the formatter 1100 of the au- 
thoring encoder EC in Fig. 25 is described later with reference to Fig. 58. 

[0564] The cell playback information (PCG information entries C_PBI) of the VTS information VTSI set as previously 
described is then recorded. 

35 [0565] At step #2330 it is determined whether all VOB Sets declared by the VOB Set number VOBS_NUM have 
been processed by the formatter. If NO, control loops back to step #2312, and the process runs again. If YES, all sets 
have been formatted, the procedure terminates. 

[0566] Referring to Fig. 57, the multi-angle non-seamless switching control routine executed in step #2326 when 
step #2322, Fig. 56, returns NO is described. This routine defines the interleaved arrangement of the multimedia bit- 
40 stream MBS, the content of the cell playback information (C_PBI#i) shown in Fig. 1 6, and the information stored to the 
navigation pack NV shown in Fig, 20, in the generated DVD multimedia bitstream MBS. 

[0567] At step #2340 based on the multi-angle flag VOB Fm setting {= 1) declaring whether multi-angle control is 
applied in the multi-scene period, the cell block mode CBM (Fig. 16) of the cell playback information blocks C_PBI #i 
containing the VOB control information for each scene is declared according to the position of the angle data. For 
45 example, the cell block mode CBM of the MA1 cell (Fig. 23) is declared as 01b to indicate the beginning of the cell 
block, the CBM of MA2 is declared as 1 0b to indicate a cell between the first and last cells in the block, and the CBM 
of MA3 is declared as 11b to indicate the end of the cell block. 

[0568] At step #2342 based on the multi-angle flag VOB_Fm setting (= 1) declaring whether multi-angle control is 
applied in the multi-scene period, the cell block type CBT (Fig. 16) of the cell playback information blocks C_PBI #i 
so containing the VOB control information for each scene is declared as 01b to indicate an "angle." 

[0569] At step #2344 the seamless playback flag SPF (Fig. 16) is set to 1 in the cell playback information blocks 
C_PBI #i containing the VOB control information for each scene based on the preceding VOB seamless connection 
flag VOB__Fsb state, which is set to 1 to indicate a seamless connection. 

[0570] At step #2346 the STC resetting flag STCDF is set to 1 in the cell playback information blocks C_PBI #i 
55 containing the VOB control information for each scene based on the preceding VOB seamless connection flag VOB_Fsb 
state, which is set to 1 to indicate a seamless connection. 

[0571] At step #2348 the interleaved allocation flag IAF (Fig. 16) is set to 1 in the cell playback information blocks 
C_PBI #i containing the VOB control information for each scene based on the multi-angle seamless switching flag 
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VOB_FsV state, which is set to 1 to indicate interleaving is required. 

[0572] At step #2350 the location of the navigation pack NV (relative sector number from the VOB beginning) is 
detected from the title editing unit (VOB below) obtained from the system encoder 900 in Fig. 25, the navigation pack 
NV is detected based on the minimum interleaved unit presentation time ILVU_MT information (a formatter parameter 

5 obtained in step #1816, Fig. 51 ), the location of the VOBU expressed as the number of sectors from the VOB beginning, 
for example, is thus obtained, and the title editing unit VOB is divided into interleave units using VOBU units. 
[0573] For example, if in this example the minimum interleaved unit presentation time ILVU_MT is 2 sec and the 
presentation time of one VOBU is 0.5 sec, then the VOB is divided into interleave units of 4 VOBU each. Note that 
this allocation operation is applied to the VOB constituting each multi-scene data unit. 

10 [0574] At step #2352 the interleave units of each VOB obtained from step #2350 are arranged in the cell block mode 
CBM sequence (cell block beginning, middle, and end cells) written as the VOB control information for each scene in 
step #2340 to form the interleaved blocks as shown in Fig. 37 or 38. The interleaved blocks are then added to the VTS 
title VOBS (VTSTT_VOBS). Using the cell block mode CBM declarations above, for example, the angle data MA1 , 
MA2, and MA3 (Fig. 23) are arranged in that sequence. 

is [0575] At step #2354 the relative sector number from the VOBU start is written to the VOB end pack address 
VOBU_EA (Fig. 20) in the navigation pack NV of each VOBU based on the VOBU position information obtained in step 
#2350. 

[0576] At step #2356 the first cell VOBU start address C_FVOBU_SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the number of sectors from the beginning of the VTS title VOBS (VTSTT_VOBS) are 
20 written as the addresses of the navigation packs NV of the first and last VOBU in each cell based on the VTS title 
VOBS (VTSTT_VOBS) data obtained in step #2352. 

[0577] The angle #i VOBU start address N SM L_ AG L_C 1 __DSTA - N S M L_ AG L__C 9_D STA of the non-seamless angle 
information NSML_AGLI (Fig. 20) in the navigation pack NV of each VOBU is written at step #2358. This address is 
expressed as the relative sector number inside the data of the interleaved blocks formed in step #2352, and declares 
25 the address information (Fig. 50) of the navigation pack NV contained in the VOBU of all angle scenes near the pres- 
entation start time of the VOBU being processed. 

[0578] At step #2360 VFFFFFFFh" is written to the angle #i VOBU start address NSML_AGL_C1_DSTA - 
NSML_AGL_C9_DSTA of the non-seamless angle information NSML_AGLI (Fig. 20) in the navigation pack NV of each 
VOBU if the VOBU being processed is the last VOBU of each scene in the multi-scene period. 

30 [0579] This routine thus formats the interleaved blocks for multi-angle non-seamless switching control in the multi- 
scene period, and formats the cell control information as the reproduction control information for those multiple scenes. 
[0580] Referring to Fig. 58, the multi-angle seamless switching control routine executed in step #2324 when step 
#2322, Fig. 56, returns YES is described. This routine defines the interleaved arrangement of the multimedia bitstream 
MBS, the content of the cell playback information (C_PBI#i) shown in Fig. 16, and the information stored to the navi- 

35 gation pack NV shown in Fig. 20, in the generated DVD multimedia bitstream MBS. 

[0581] At step #2370 based on the multi-angle flag VOB_Fm setting (= 1) declaring whether multi-angle control is 
applied in the multi-scene period, the cell block mode CBM (Fig. 16) of the cell playback information blocks C_PBI #i 
containing the VOB control information for each scene is declared according to the position of the angle data. For 
example, the cell block mode CBM of the MA1 ceil (Fig. 23) is declared as 01b to indicate the beginning of the cell 

40 block, the CBM of MA2 is declared as 10b to indicate a cell between the first and last cells in the block, and the CBM 
of MA3 is declared as 11b to indicate the end of the cell block. 

[0582] At step #2372 based on the multi-angle flag VOB_Fm setting (= 1) declaring whether multi-angle control is 
applied in the multi-scene period, the cell block type CBT (Fig. 16) of the cell playback information blocks C_PBI #i 
containing the VOB control information for each scene is declared as 01b to indicate an "angle." 
45 [0583] At step #2374 the seamless playback flag SPF (Fig. 16) is set to 1 in the cell playback information blocks 
C_PBI #i containing the VOB control information for each scene based on the preceding VOB seamless connection 
flag VOB_Fsb state, which is set to 1 to indicate a seamless connection. 

[0584] At step #2376 the STC resetting flag STCDF is set to 1 in the cell playback information blocks C_PBI #i 
containing the VOB control information for each scene based on the preceding VOB seamless connection flag VOB_Fsb 
50 state, which is set to 1 to indicate a seamless connection. 

[0585] At step #2378 the interleaved allocation flag IAF (Fig. 16) is set to 1 in the cell playback information blocks 
C_PBI #i containing the VOB control information for each scene based on the multi-angle seamless switching flag 
VOB_FsV state, which is set to 1 to indicate interleaving is required. 

[0586] At step #2380 the location of the navigation pack NV (relative sector number from the VOB beginning) is 
55 detected from the title editing unit (VOB below) obtained from the system encoder 900 in Fig. 25, the navigation pack 
NV is detected based on the minimum interleaved unit presentation time ILVU_MT information (a formatter parameter 
obtained in step #1 854, Fig. 53), the location of the VOBU expressed as the number of sectors from the VOB beginning, 
for example, is thus obtained, and the title editing unit VOB is divided into interleave units using VOBU units. 
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[0587] For example, if in this example the minimum interleaved unit presentation time ILVU_MT is 2 sec and the 
presentation time of one VOBU is 0.5 sec, then the VOB is divided into interleave units of 4 VOBU each. Note that 
this allocation operation is applied to the VOB constituting each muiti-scene data unit. 

[0588] At step #2382 the interleave units of each VOB obtained from step #2380 are arranged in the cell block mode 
5 CBM sequence (cell block beginning, middle, and end cells) written as the VOB control information for each scene in 
step #2360 to form the interleaved blocks as shown in Fig. 37 or 38. The interleaved blocks are then added to the VTS 
title VOBS (VTSTT_VOBS). Using the cell block mode CBM declarations above, for example, the angle data MA1 , 
MA2, and MA3 (Fig. 23) are arranged in that sequence. 

[0589] At step #2384 the relative sector number from the VOBU start is written to the VOB end pack address VOBU 
10 EA (Fig. 20) in the navigation pack NV of each, VOBU based on the VOBU position information obtained in step #2360. 
[0590] At step #2386 the first cell VOBU start address C_FVOBU_SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the number of sectors from the beginning of the VTS title VOBS (VTSTT_VOBS) are 
written as the addresses of the navigation packs NV of the first and last VOBU in each cell based on the VTS title 
VOBS (VTSTT_VOBS) data obtained in step #2382. 
is [0591] At step #2388 the relative sector number from the VOBU start is written to the VOB end pack address 
VOBU_EA (Fig. 20) in the navigation pack NV of each VOBU based on the interleave unit data obtained in step #2370. 
[0592] The angle #i VOBU start address S M L_AG L_C 1 _D STA-S M L_ AG L_C9_D STA of the seamless angle infor- 
mation SML_AGLI (Fig. 20) in the navigation pack NV of each VOBU is written at step #2390. This address is expressed 
as the relative sector number inside the data of the interleaved blocks formed in step #2382, and declares the address 
20 information (Fig. 50) of the navigation pack N V contained in the VOBU of all angle scenes with a start time contiguous 
to the reproduction end time of the VOBU being processed. 

[0593] At step #2392 "7FFFFFFFh" is written to the angle #i VOBU start address SML_AGL_C1_DSTA - 
SML_AGL_C9_DSTA of the seamless angle information SML_AGLI (Fig. 20) in the navigation pack NV of the VOBU 
contained in the interleaved unit if the interleave unit arranged in step #2382 is the last interleave unit of each scene 

25 in the multi-scene period. 

[0594] This routine thus formats the interleaved blocks for multi-angle seamless switching control in the multi-scene 
period, and formats the cell control information as the reproduction control information for those multiple scenes. 
[0595] The parental lock subroutine (step #2320, Fig. 56) executed when step #2318 in Fig. 56 returns NO, i.e., 
when it is determined that parental lock control is implemented and not multi-angle control, is described next with 

30 reference to Fig. 59. 

[0596] The parental lock subroutine described below writes the interleave unit arrangement of the multimedia bit- 
stream, the content of the PGC information entries C_PBI #i (cell playback information) shown in Fig. 16, and the 
navigation pack NV information shown in Fig. 20, to the generated DVD multimedia bitstream. 
[0597] At step #2402 a value "00b" is written to the cell block mode CBM (Fig. 1 6) of the cell playback information 
35 blocks C_PBI #i containing the VOB control information for each scene based on the multi-angle flag VOB_Fm state, 
which is set to 0 to indicate that multi-angle control is not enabled in the multi-scene period. 

[0598] At step #2404 the seamless playback flag SPF (Fig. 16) is set to 1 in the cell playback information blocks 
C_PBI #i containing the VOB control information for each scene based on the preceding VOB seamless connection 
flag VOB_Fsb state, which is set to 1 to indicate a seamless connection. 
40 [0599] At step #2406 the STC resetting flag STCDF is set to 1 in the cell playback information blocks C_PBI #i 
containing the VOB control information for each scene based on the preceding VOB seamless connection flag VOB_Fsb 
state, which is set to 1 to indicate a seamless connection. 

[0600] At step #2408 the interleaved allocation flag IAF (Fig. 16) is set to 1 in the cell playback information blocks 
C_PBi #i containing the VOB control information for each scene based on the multi-angle seamless switching flag 

45 VOB_FsV state, which is set to 1 to indicate interleaving is required. 

[0601] At step #241 0 the navigation pack NV position information (the relative sector number from the VOB start) is 
detected from the title editing unit (VOB) obtained from the system encoder 900 (Fig. 25). The navigation pack NV is 
then detected based on the number of interleaved VOB divisions ILV_DIV, a formatter parameter obtained in step 
#1874 in Fig. 55, to obtain the VOBU position information (number of sectors from the VOB start), and divide each 

50 VOB into the specified number of interleave units in VOBU units. 

[0602] At step #2412 the interleave units obtained in step #2410 are then interleaved. For example, the interleave 
units are arranged in ascending VOB number sequence to create the interleaved blocks as shown in Fig. 37 or 38, 
and the interleaved blocks are added to the VTS title VOBS (VTSTT_VOBS). 

[0603] At step #2414 the relative sector number from the VOBU start is written to the VOB end pack address 
55 VOBU_EA (Fig. 20) in the navigation pack NV of each VOBU based on the VOBU position information obtained in step 
#2186. 

[0604] At step #2416 the first cell VOBU start address C_FVOBU__SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the number of sectors from the beginning of the VTS title VOBS (VTSTT_VOBS) are 
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written as the addresses of the navigation packs NV of the first and last VOBU in each cell based on the VTS title 
VOBS (VTSTT_VOBS) data obtained in step #2412. 

[0605] At step #241 8 the relative sector number to the last interleave unit pack is written to the ILVU end pack address 
ILVUJEA in the navigation pack NV of the VOBU forming the interleaved units based on the interleaved unit data 
5 obtained from step #2412. 

[0606] At step #2420, the relative sector number in the interleaved block data formed in step #2412 is written to the 
nexHLVU start address NT_ILVU_SA as the position information of the next ILVU in the navigation packs NV of the 
VOBU contained in the interleaved unit ILVU. 

[0607] At step #2422 the interleaved unit flag ILVU flag is set to 1 in the navigation packs NV of the VOBU contained 

19 in the interleaved unit ILVU. 

[0608] At step #2424, the Unit END flag of the navigation pack NV in the last VOBU of the interleaved unit ILVU is 
set to 1 . 

[0609] At step #2426 "FFFFFFFFh" is written to the nexHLVU start address NT_ILVU_SA of the navigation pack NV 
of the VOBU in the last interleaved unit ILVU of each VOB. 
15 [061 0] The operation described above thus formats the interleaved blocks to enable parental lock control in the multi- 
scene periods, and formats the control information in the cells, i.e., the cell playback control information for the multi- 
scene periods. 

[0611] The single scene subroutine executed as step #2314 in Fig. 56 when steps #2312 or #2316 return NO, i.e. , 
when the scene is determined to be a single scene and not a multi-scene period, is described next using Fig. 60. 

20 [0612] The single scene subroutine described below writes the interleave unit arrangement of the multimedia bit- 
stream, the content of the PGC information entries C_PBI #i (cell playback information) shown in Fig. 16, and the 
navigation pack NV information shown in Fig. 20, to the generated DVD multimedia bitstream. 

[0613] At step #2430 a value "OOb" indicating a "non-cell block", i.e., that there is only one cell in the functional block, 
is written to the cell block mode CBM (Fig. 16) of the cell playback information blocks C_PBI #i containing the VOB 
25 control information for each scene based on the multi-scene flag VOB_Fp state, which is set to 0 to indicate that the 
scene is a single scene and not part of a multi-scene period. 

[0614] At step #2432 the interleaved allocation flag IAF (Fig. 16) is set to 0 in the cell playback information blocks 
C_PBI #j containing the VOB control information for each scene based on the multi-angle seamless switching flag 
VOB_FsV state, which is set to 0 to indicate interleaving is not required. 

30 [0615] At step #2434 the navigation pack NV position information (the relative sector number from the VOB start) is 
detected from the title editing unit (VOB) obtained from the system encoder 900 (Fig. 25), placed in the VOBU unit, 
and added to the VTS title VOBS (VTSTT_VOBS), the video and other stream data of the multimedia bitstream. 
[0616] At step #2436 the relative sector number from the VOBU start is written to the VOB end pack address 
VOBU_EA (Fig. 20) in the navigation pack NV of each VOBU based on the VOBU position information obtained in step 

35 #2434. 

[0617] At step #2438 the first cell VOBU start address C_FVOBU_SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the number of sectors from the beginning of and the end of, respectively, the VTS title 
VOBS (VTSTT_VOBS) of the value written as the addresses of the navigation packs NV of the first and last VOBU in 
cell based on the VTS title VOBS (VTSTT_VOBS) data obtained in step #2434. 
40 [0618] At step #2440 the state determined as a result of step #300 or #600 in Fig. 51 , i.e., whether preceding VOB 
seamless connection flag VOB_Fsb is set to 1 indicating a seamless connection to the preceding or following scenes, 
is evaluated. If step #2440 returns YES, the procedure moves to step #2442. 

[0619] At step #2442 the seamless playback flag SPF (Fig. 16) is set to 1 in the cell playback information blocks 
C_PBI #i containing the VOB control information for each scene based on the preceding VOB seamless connection 
45 flag VOB_Fsb state, which is set to 1 to indicate a seamless connection. 

[0620] At step #2444 the STC resetting flag STCDF is set to 1 in the cell playback information blocks C_PBI #i 
containing the VOB control information for each scene based on the preceding VOB seamless connection flag VOB_Fsb 
state, which is set to 1. 

[0621] If step #2440 returns NO, i.e., there is not a seamless connection to the preceding scene, the procedure 
so moves to step #2446. 

[0622] At step #2446 the seamless playback flag SPF (Fig. 16) is set to 0 in the cell playback information blocks 
C_PBI #i containing the VOB control information for each scene based on the preceding VOB seamless connection 
flag VOB_Fsb state, which is set to 0 to indicate a non-seamless connection. 

[0623] At step #2448 the STC resetting flag STCDF is set to 0 in the cell playback information blocks C_PBI #i 
55 containing the VOB control information for each scene based on the preceding VOB seamless connection flag VOB_Fsb 
state, which is set to 0. 

[0624] The operation described above thus formats a multimedia bitstream for a single scene period, and records 
the control information in the cells, i.e., the cell playback control information (C_PBI #i, Fig. 16), and the information 
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in the navigation pack NV (Fig. 20), to the produced DVD multimedia bitstream. 

Decoder flow charts 

5 Disk-to-stream buffer transfer flow 

[0625] The decoding information table produced by the decoding system controller 2300 based on the scenario 
selection data St51 is described below referring to Figs. 62 and 63. The decoding information table comprises the 
decoding system table shown in Fig. 62, and the decoding table shown in Fig. 63. 
.10 . , [0626] As shown in Fig. 62 , the decoding system table comprises a scenario information register and a ceil information 
register. The scenario information register records the title number and other scenario reproduction information selected 
by the user and extracted from the scenario selection data St51 . The cell information register extracts and records the 
information required to reproduce the cells constituting the program chain PGC based on the user-defined scenario 
information extracted into the scenario information register. 
15 [0627] More specifically, the scenario information register contains plural sub-registers, i.e., the angle number 
ANGLE_NO_reg, VTS number VTS_NO_reg, PGC number VTS_PGCI_NO_reg, audio ID AUDIO_ID_reg, sub-picture 
ID SPJD_reg, and the system clock reference SCR buffer SCR_buffer. 

[0628] The angle number ANGLE_NO_reg stores which angle is reproduced when there are multiple angles in the 
reproduction program chain PGC. 
20 [0629] The VTS number VTS_NO_reg records the number of the next VTS reproduced from among the plural VTS 
on the disk. 

[0630] The PGC number VTS_PGCI_NO_reg records which of the plural program chains PGC present in the video 
title set VTS is to be reproduced for parental lock control or other applications. 

[0631] The audio ID AUDIOJD_reg records which of the plural audio streams in the VTS are to be reproduced. 
25 [0632] The sub-picture ID SP_ID_reg records which of the plural sub-picture streams is to be reproduced when there 
are plural sub-picture streams in the VTS. 

[0633] The system clock reference SCR buffer SCR_buffer is the buffer for temporarily storing the system clock 
reference SCR recorded to the pack header as shown in Fig. 19. As described using Fig. 26, this temporarily stored 
system clock reference SCR is output to the decoding system controller 2300 as the bitstream control data St63. 
30 [0634] The cell information register contains the following sub-registers: the cell block mode CBIVLreg, cell block 
type CBT_reg, seamless reproduction flag SPF_reg, interleaved allocation flag IAF_reg, STC resetting flag STCDF, 
seamless angle change flag SACF__reg, first cell VOBU start address C_FVOBU_SA_reg, and last cell VOBU start 
address CJ_VOBU_SA_reg. 

[0635] The cell block mode CBM_reg stores a value indicating whether plural cells constitute one functional block. 
35 |f there are not plural cells in one functional block, CBM_reg stores N_BLOCK. If plural cells constitute one functional 
block, the value F_CELL is stored as the CBM_jeg value of the first cell in the block, L_CELL is stored as the CBM_reg 
value of the last cell in the block, and BLOCK is stored as the CBM_reg of value all cells between the first and last 
cells in the block. 

[0636] The cell block type CBT_reg stores a value defining the type of the block indicated by the cell block mode 
40 CBM_reg. If the cell block is a multi-angle block, A_BLOCK is stored; if not, N_BLOCK is stored. 

[0637] The seamless reproduction flag SPF_reg stores a value defining whether that cell is seamless connected 
with the cell or cell block reproduced therebefore. If a seamless connection is specified, SML is stored; if a seamless 
connection is not specified, NSML is stored. 

[0638] The interleaved allocation flag IAF_reg stores a value identifying whether the cell exists in a contiguous or 
45 interleaved block. If the cell is part of a an interleaved block, ILVB is stored; otherwise N JLVB is stored. 

[0639] The STC resetting flag STCDF defines whether the system time clock STC used for synchronization must be 
reset when the cell is reproduced; when resetting the system time clock STC is necessary, STC_RESET is stored; if 
resetting is not necessary, STC_N RESET is stored. 

[0640] The seamless angle change flag SACF_reg stores a value indicating whether a cell in a multi-angle period 
50 should be connected seamlessly at an angle change. If the angle change is seamless, the seamless angle change 
flag SACF is set to SML; otherwise it is set to NSML. 

[0641] The first cell VOBU start address C_FVOBU_SA_reg stores the VOBU start address of the first cell in a block. 
The value of this address is expressed as the distance from the logic sector of the first cell in the VTS title VOBS 
(VTSTT_VOBS) as measured by and expressed (stored) as the number of sectors. 
55 [0642] The last cell VOBU start address C_LVOBU_SA_reg stores the VOBU start address of the last cell in the 
block. The value of this address is also expressed as the distance from the logic sector of the first cell in the VTS title 
VOBS (VTSTT_VOBS) measured by and expressed (stored) as the number of sectors. 

[0643] The decoding table shown in Fig. 63 is described below. As shown in Fig. 63, the decoding table comprises 
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the following registers: information registers for non-seamless multi-angle control, information registers for seamless 
multi-angle control, a VOBU information register, and information registers for seamless reproduction. 
[0644] The information registers for non-seamless multi-angle control comprise sub-registers 
NSML_AGL_C1_DSTA_reg - NSML AGL_C9_DSTA_reg. 
5 [0645] N S M L_AG L_C 1 _DSTA_reg - NSML_AGL_C9_DSTA_reg record the N M S L_ AG L_C 1 _D STA - NMSL AGL 
C9_DSTA values in the PCI packet shown in Fig. 20. 

[0646] The information registers for seamless multi-angle control comprise sub-registers SML_AGL_C1_DSTA_reg 
- SML_AGL_C9_DSTA_reg. 

[0647] S M L_AG L_C 1 _DSTA_reg - SML_AGL_C9_DSTA_reg record the SM L_AG L_C 1 _DSTA - SML_AGL C9 

10 DSTA values in the DSI packet shown in Fig. 20. 

[0648] The VOBU information register stores the end pack address VOBU_EA in the DSI packet shown in Fig. 20. 
[0649] The information registers for seamless reproduction comprise the following sub-registers: an interleaved unit 
flag ILVU Jlag_reg, Unit END flag UNIT_END_flag_reg, Interleaved Unit End Address ILVU_EA_reg, Next Interleaved 
Unit Start Address NTJLVU_SA_reg, the presentation start time of the first video frame in the VOB (Initial Video Frame 

15 Presentation Start Time) VOB_V_SPTM_reg, the presentation end time of the last video frame in the VOB (Final Video 
Frame Presentation Termination Time) VOB_V_EPTM_reg, audio reproduction stopping time 1 
VOB_A_STP_PTM1_reg, audio reproduction stopping time 2 VOB_A_STP_PTM2_reg, audio reproduction stopping 
period 1 VOB_A_GAP_LEN1 _reg, and audio reproduction stopping period 2 VOB_A„GAP_LEN2_reg. 
[0650] The interleaved unit flag ILVU_flag_reg stores the value indicating whether the video object unit VOBU is in 

20 an interleaved block, and stores ILVU if it is, and NJLVU if not. 

[0651] The Unit END flag UNIT_END_f!ag_reg stores the value indicating whether the video object unit VOBU is the 
last VOBU in the interleaved unit ILVU. Because the interleaved unit ILVU is the data unit for continuous reading, the 
UNIT_END_flag_reg stores END if the VOBU currently being read is the last VOBU in the interleaved unit ILVU, and 
otherwise stores N_END. 

25 [0652] The Interleaved Unit End Address ILVU_EA_reg stores the address of the last pack in the ILVU to which the 
VOBU belongs if the VOBU is in an interleaved block. This address is expressed as the number of sectors from the 
navigation pack NV of that VOBU. 

[0653] The Next Interleaved Unit Start Address NT_ILVU_SA_reg stores the start address of the next interleaved 
unit ILVU if the VOBU is in an interleaved block. This address is also expressed as the number of sectors from the 
30 navigation pack NV of that VOBU. 

[0654] The Initial Video Frame Presentation Start Time register VOB_V_SPTM„reg stores the time at which pres- 
entation of the first video frame in the VOB starts. 

[0655] The Final Video Frame Presentation Termination Time register VOB_V_EPTM_reg stores the time at which 
presentation of the last video frame in the VOB ends. 
35 [0656] The audio reproduction stopping time 1 VOB_A_STP_PTM1_reg stores the time at which the audio is to be 
paused to enable resynchronization, and the audio reproduction stopping period 1 VOB_AJ3AP_LEN1_reg stores the 
length of this pause period. 

[0657] The audio reproduction stopping time 2 VOB_A_STP_PTM2_reg and audio reproduction stopping period 2 
VOB_A GAP_LEN2_reg store the same values. 
40 [0658] The operation of the DVD decoder DCD according to the present invention as shown in Fig. 26 is described 
next below with reference to the flow chart in Fig. 69. 

[0659] At step #31 0202 it is first determined whether a disk has been inserted. If it has, the procedure moves to step 
#310204. 

[0660] At step #310204, the volume file structure VFS (Fig. 21) is read, and the procedure moves to step #310206. 
45 [0661] At step #310206, the video manager VMG (Fig. 21) is read and the video title set VTS to be reproduced is 
extracted. The procedure then moves to step #310208. 

[0662] At step #310208, the video title set menu address information VTSM_C_ADT is extracted from the VTS in- 
formation VTSI, and the procedure moves to step #310210. 

[0663] At step #31 021 0 the video title set menu VTSM_VOBS is read from the disk based on the video title set menu 

so address information VTSM_C_ADT, and the title selection menu is presented. 

[0664] The user is thus able to select the desired title from this menu in step #310212. If the titles include both 
contiguous titles with no user-selectable content, and titles containing audio numbers, sub-picture numbers, or multi- 
angle scene content, the user must also enter the desired angle number. Once the user selection is completed, the 
procedure moves to step #310214. 

55 [0665] At step #31 021 4, the VTS_PGCI #i program chain (PGC) data block corresponding to the title number selected 
by the user is extracted from the VTSPGC information table VTS_PGCIT, and the procedure moves to step #31021 6. 
[0666] Reproduction of the program chain PGC then begins at step #31 021 6. When program chain PGC reproduction 
is finished, the decoding process ends. If a separate title is thereafter to be reproduced as determined by monitoring 
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key entry to the scenario selector, the title menu is presented again (step #310210). 

[0667] Program chain reproduction in step #310216 above is described in further detail below referring to Fig. 64. 
The program chain PGC reproduction routine consists of steps #31 030, #31032, #31034, and #31035 as shown. 
[0668] At step #31 030 the decoding system table shown in Fig. 62 is defined. The angle number ANGLE_NO_reg, 
5 VTS number VTS_NO_/eg, PGC number VTS_PGCI_NO_reg, audio ID AUDIO_ID_reg, and sub-picture ID SP_ID_reg 
are set according to the selections made by the user using the scenario selector 2100. 

[0669] Once the PGC to be reproduced is determined, the corresponding cell information (PGC information entries 
C_PBI #j) is extracted and the cell information register is defined. The sub-registers therein that are defined are the 
cell block mode CBM_reg, cell block type CBT_reg, seamless reproduction flag SPF_reg, interleaved allocation flag 
10 IAF_reg, STC resetting flag STCDF, seamless angle change flag SACF_reg, first cell VOBU start address 
C_FVOBU_SA_reg, and last cell VOBU start address C_LVOBU_SA_reg. 

[0670] Once the decoding system table is defined, the process transferring data to the stream buffer (step #31 032) 
and the process decoding the data in the stream buffer (step #31034) are activated in parallel. 
[0671 ] The process transferring data to the stream buffer (step #31 032) is the process of transferring data from the 
15 recording medium M to the stream buffer 2400. This is, therefore, the processing of reading the required data from the 
recording medium M and inputting the data to the stream buffer 2400 according to the user-selected title information 
and the playback control information (navigation packs NV) written in the stream. 

[0672] The routine shown as step #31 034 is the process for decoding the data stored to the stream buffer 2400 (Fig. 
26), and outputting the decoded data to the video data output terminal 3600 and audio data output terminal 3700. Thus, 
20 is the process for decoding and reproducing the data stored to the stream buffer 2400. 
[0673] Note that step #31 032 and step #31 034 are executed in parallel. 

[0674] The processing unit of step #31 032 is the cell, and as processing one cell is completed, it is determined in 
step #31 035 whether the complete program chain PGC has been processed. If processing the complete program chain 
PGC is not completed, the decoding system table is defined for the next cell in step #31 030. This loop from step #31 030 
25 through step #31035 is repeated until the entire program chain PGC is processed. 

[0675] The stream buffer data transfer process of step #31032 is described in further detail below referring to Fig. 
70. The stream buffer data transfer process (step #31032) comprises steps #31040, #31042, #31044, #31046, and 
#31 048 shown in the figure. 

[0676] At step #31040 it is determined whether the cell is a multi-angle cell. If not, the procedure moves to step 
30 #30144. 

[0677] At step #31 044 the non-multi-angle cell decoding process is executed. 

[0678] However, if step #30140 returns YES because the cell is a multi-angle cell, the procedure moves to step 
#30142 where the seamless angle change flag SACF is evaluated to determine whether seamless angle reproduction 
is specified. 

35 [0679] If seamless angle reproduction is specified, the seamless multi-angle decoding process is executed in step 
#30146. If seamless angle reproduction is not specified, the non-seamless multi-angle decoding process is executed 
in step #30148. 

[0680] The non-multi-angle cell decoding process (step #31044, Fig. 70) is described further below with reference 
to Fig. 71. Note that the non-multi-angle cell decoding process (step #31044) comprises the steps #31050, #31052, 
40 and #31054. 

[0681] The first step #31 050 evaluates the interleaved allocation flag IAF_reg to determine whether the cell is in an 
interleaved block. If it is, the non-multi-angle interleaved block process is executed in step #31052. 
[0682] The non-multi-angle interleaved block process (step #31052) processes scene branching and connection 
where seamless connections are specified in, for example, a multi-scene period. 
45 [0683] However, if the cell is not in an interleaved block, the non-multi-angle contiguous block process is executed 
in step #31054. Note that the step #31054 process is the process executed when there is no scene branching or 
connection. 

[0684] The non-multi-angle interleaved block process (step #31 052, Fig. 71) is described further below with reference 
to Fig. 72. 

so [0685] At step #31060 the reading head 2006 is jumped to the first cell VOBU start address C_FVOBU_SA read 
from the C_FVOBU_SA_reg register. 

[0686] More specifically, the address data C_FVOBU_SA_reg stored in the decoding system controller 2300 (Fig. 
26) is input as bitstream reproduction control signal St53 to the reproduction controller 2002. The reproduction controller 
2002 thus controls the recording media drive unit 2004 and signal processor 2008 to move the reading head 2006 to 
55 the specified address, data is read, error correction code ECC and other signal processing is accomplished by the 
signal processor 2008, and the cell start VOBU data is output as the reproduced bitstream St61 to the stream buffer 
2400. The procedure then moves to step #31062. 

[0687] At step #31 062 the DSI packet data in the navigation pack NV (Fig. 20) is extracted in the stream buffer 2400, 
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the decoding table is defined, and the procedure moves to step #31064. The registers set in the decoding table are 
the ILVU_EA_reg, NT_ILVlLSA_reg, VOB_V_SPTM_reg, VOB_V_EPTM_reg, VOB_A_STP_PTM1_reg, 
VOB_A_STP_PTM2_reg, VOB_AJ3APJ_EN1_reg, and VOB_A_GAP_LEN2_reg. 

[0688] At step #31064 the data from the first cell VOBU start address C_FVOBU_SA_reg to the ILVU end pack 
5 address ILVU_EA_reg, i.e., the data for one interleaved unit ILVU, is transferred to the stream buffer 2400. The pro- 
cedure then moves to step #31 066. 

[0689] More specifically, the address data ILVU_EA_reg stored in the decoding system controller 2300 (Fig. 26) is 
supplied to the reproduction controller 2002. The reproduction controller 2002 thus controls the recording media drive 
unit 2004 and signal processor 2008 to read the data to the ILVU_EA_reg address, and after error correction code 
10 ECC and other.signal processing is accomplished by the signal processor 2008, the data for the first ILVU in the cell 
is output as the reproduced bitstream St61 to the stream buffer 2400. It is thus possible to output the data for one 
contiguous interleaved unit ILVU on the recording medium M to the stream buffer 2400. 

[0690] At step #31066 it is determined whether all interleaved units in the interleaved block have been read and 
transferred. If the interleaved unit ILVU processed is the last ILVU in the interleaved block, "0x7FFFFFFF" indicating 
15 termination is set to the next-ILVU start address NT_ILVU_SA_reg as the next read address. If all interleaved units in 
the interleaved block have thus been processed, the procedure moves to step #31 068. 

[0691 ] At step #31 068 the reading head 2006 is again jumped to the address NT_ILVU_S A_reg of the next interleave 
unit to be reproduced, and the procedure loops back to step #31062. Note that this jump is also accomplished as 
described above, and the loop from step #31062 to step #31068 is repeated. 
20 [0692] However, if step #31066 returns YES, i.e., all interleaved unit ILVU in the interleaved block have been trans- 
ferred, step #31052 terminates. 

[0693] The non-multi-angle interleaved block process (step #31 052) thus transfers the data of one cell to the stream 
buffer 2400. 

[0694] The non-multi-angle contiguous block process is executed in step #31 054, Fig. 71 , is described further below 
25 with reference to Fig. 73. 

[0695] At step #31070 the reading head 2006 is jumped to the first cell VOBU start address C_FVOBU_SA read 
from the C_FVOBU_SA_reg register. This jump is also accomplished as described above, and the loop from step 
#31072 to step #31076 is initiated. 

[0696] At step #31 072 the DSI packet data in the navigation pack N V (Fig. 20) is extracted in the stream buffer 2400, 
30 the decoding table is defined, and the procedure moves to step #31074. The registers set in the decoding table are 
the VOBU_EA_reg, VOB_V_SPTM_reg, VOB_V_EPTM_reg, VOB_A_STP_PTM1_reg, VOB_A_STP_PTM2_reg, 
VO B_A_G AP_L E N 1 _reg , and VO B_ A_G A P_L E N 2_reg . 

[0697] At step #31 074 the data from the first cell VOBU start address C_FVOBU_SA_reg to the end pack address 
VOBU_EA_reg, i.e., the data for one video object unit VOBU, is transferred to the stream buffer 2400. The procedure 
35 then moves to step #31076. The data for one video object unit VOBU contiguously arrayed to the recording medium 
M can thus be transferred to the stream buffer 2400. 

[0698] At step #31 076 it is determined whether all cell data has been transferred. If all VOBU in the cell has not been 
transferred, the data for the next VOBU is read continuously, and the process loops back to step #31070. 
[0699] However, if all VOBU data in the cell has been transferred as determined by the CJ_VOBU_SA_reg value in 
40 step #31076, the non-multi-angle contiguous block process (step #31054) terminates. This process thus transfers the 
data of one cell to the stream buffer 2400. 

[0700] Another method of accomplishing the non-multi-angle cell decoding process (step #31044, Fig. 70) is de- 
scribed below with reference to Fig. 74. 

[0701 ] At step #31 080 the reading head 2006 is jumped to the first cell VOBU start address C_FVOBU_SA_reg, and 
45 the first VOBU data in the cell is transferred to the stream buffer 2400. The procedure then moves to step #31 081 . 
[0702] At step #31 081 the DSI packet data in the navigation pack NV (Fig. 20) is extracted in the stream buffer 2400, 
the decoding table is defined, and the procedure moves to step #31082. The registers set in the decoding table are 
the SCR_buffer, VOBU_EA_reg, ILVUJIag_reg, UNIT_END_flag_reg, ILVU_EA_reg, NT_ILVU_SA_reg, 
VOB_V_S PTM_reg, VOB_V_EPTM_reg, VOB_A_STP_PTM 1 _reg, VOB_A_STP_PTM2__reg, 

so VOB_Aj3AP_LEN1_reg, and VO B_A_G A P_LEN2_reg . 

[0703] At step #31082 the data from the first cell VOBU start address C_FVOBU_SA_reg to the end pack address 
VOBU_EA_reg, i.e., the data for one video object unit VOBU, is transferred to the stream buffer 2400. The procedure 
then moves to step #31083. 

[0704] At step #31083 is determined whether all cell VOBU data has been transferred. If it has, the process (step 
55 #31 044) terminates. If it has not, the procedure moves to step #31 084. 

[0705] At step #31 084 it is determined whether the VOBU is the last VOBU in the interleaved unit. If not, the process 
loops back to step #31 081 . If so, the procedure advances to step #31 085. It is thus possible to transfer one cell of data 
in VOBU units to the stream buffer 2400. 
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[0706] The loop from step #31 081 to step #31 084 repeats as described above. 

[0707] At step #31085 it is determined whether the interleaved unit ILVU is the last in the interleaved block. If so, 
step #31 044 terminates. If not, the procedure advances to step #31 086. 

[0708] At step #31 086 the reading head 2006 is jumped to the address NTJLVU_SA_reg of the next interleave unit, 
5 and the procedure loops back to step #31 081 . It is thus possible to transfer the data for one cell to the stream buffer 2400. 
[0709] The seamless multi-angle decoding process executed in step #30146, Fig. 70, is described below referring 
to Fig. 75. 

[0710] At step #31090 the reading head 2006 is jumped to the first cell VOBU start address C_FVOBU_SA read 
from the C_FVOBU_SA_reg register, and the first VOBU data in the cell is transferred to the stream buffer 2400. The 
to . procedure then moves to step #31091 . This jump is also accomplished as described. above, and the loop from step 
#31091 to step #31095 is initiated. 

[071 1 ] At step #31 091 the DSI packet data in the navigation pack N V (Fig. 20) is extracted in the stream buffer 2400, 
the decoding table is defined, and the procedure moves to step #31092. The registers set in the decoding table are 
the ILVU__EA_reg, SML_AGL_C1_DSTA_reg-SML_AGL_C9_DSTA_reg, VOB_V_SPTM_reg, VOB_V_EPTM_reg, 
15 VOB_A_STP_PTM1_reg, VOB_A_STP_PTM2_reg, VO B_ A_G A P_L E N 1 _ reg , and VOB_A_GAP_LEN2_reg. 

[0712] At step #31092 the data from the first cell VOBU start address C_FVOBU_SA_reg to the ILVU end pack 
address ILVU_EA_reg, i.e., the data for one ILVU, is transferred to the stream buffer 2400. The procedure then moves 
to step #31 093. It is thus possible to output the data for one contiguous interleaved unit ILVU on the recording medium 
M to the stream buffer 2400. 

20 [0713] At step #31 093 the ANGLEJ\IO_reg is updated, and the procedure moves to step #31 094. This update op- 
eration resets the ANGLE_NO_reg to the angle number of the angle selected by the user when the user changes the 
angle using the scenario selector 21 00 (Fig. 26). 

[071 4] At step #31 094 it is determined whether the angle cell data has all been transferred. If all ILVU in the cell have 
not been transferred, the procedure moves to step #31095. If all ILVU in the cell have been transferred, the process 
25 terminates. 

[0715] At step #31095 the reading head 2006 is jumped to the next angle (SML_AGL_C#n_reg), and the process 
loops back to step #31091 . Note that SML_AGL_C#n_reg is the address of the angle to which the ANGLE_NO_reg 
was updated in step #31093. 

[071 6] It is thus possible to transfer the data for the angle selected by the user to the stream buffer 2400 in ILVU units. 
30 [0717] The non-seamless multi-angle decoding process is executed in step #30148, Fig. 70, is described below 
referring to Fig. 65, 

[0718] At step #31100 the reading head 2006 is jumped to the first cell VOBU start address C_FVOBU_SA read 
from the C_FVOBU_SA_reg register, and the first VOBU data in the cell is transferred to the stream buffer 2400. The 
procedure then moves to step #31101 . This jump is also accomplished as described above, and the loop from step 

35 #31101 to step #31106 is initiated. 

[0719] At step #31101 the DSI packet data in the navigation pack NV (Fig. 20) is extracted in the stream buffer 2400, 
the decoding table is defined, and the procedure moves to step #31102. The registers set in the decoding table are 
the VOBU_EA_reg, N S M L_AG L_C 1 _DSTA_reg , NSML_AGL_C9_DSTA_reg, VOB_V_SPTM_reg, 
VOB_V_EPTM_reg, VOB_A_STP_PTM1 _reg, VOB_A_STP_PTM2_reg, VOB_A_GAP_LEN 1 _reg, and 

40 VOB_A_GAP_LEN2_reg. 

[0720] At step #31 1 02 the data from the first cell VOBU start address C_FVOBU_SA_reg to the end pack address 
VOBU_EA_reg, i.e., the data for one VOBU, is transferred to the stream buffer 2400. The procedure then moves to 
step #31 1 03 . It is thus possible to output the data for one contiguous video object unit VOBU on the recording medium 
M to the stream buffer 2400. 

45 [0721 ] At step #31 1 03 the ANGLE_NO_reg is updated, and the procedure moves to step #31 1 04. This update op- 
eration resets the ANGLE_NO_reg to the angle number of the angle selected by the user when the user changes the 
angle using the scenario selector 21 00 (Fig. 26). 

[0722] At step #31104 it is determined whether the angle cell data has all been transferred. If all VOBU in the cell 
have not been transferred, the procedure moves to step #31105. If all VOBU in the cell have been transferred, the 
50 process terminates. 

[0723] At step #31105 the reading head 2006 is jumped to the next angle (NSML_AGL_C#n_reg), and the process 
advances to step #31106. Note that NSM L_AG L__C#n_/eg is the address of the angle to which the ANGLE_NO_reg 
was updated in step #31103. 

[0724] It is thus possible to transfer the data for the angle selected by the user to the stream buffer 2400 in VOBU units. 
55 [0725] Step #311 06 is an effective step for high speed angle switching, and simply clears the stream buffer 2400. 
By thus clearing the stream buffer 2400 the data for the newly selected angle can be reproduced without reproducing 
the angle data that is still not decoded. In other words, clearing the stream buffer 2400 enables faster response to user 
operations. 
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[0726] It is very important that DVD decoder according to the present invention can promptly moves to the next data 
reading process and effectively performs the data reading once after the detection of the end of data such as interleave 
unit ILVU and video object unit VOBU for the sake of seamless reproduction which is one of main targets of the present 
invention. 

5 [0727] With reference to Fig. 66, a construction of the stream buffer 2400 which can performs the end detection of 
interleave unit ILVU is described briefly. The stream buffer 2400 comprises a VOB buffer 2402, a system buffer 2404, 
a navigation pack extractor 2406, and a data counter 2408. The system buffer 2404 temporarily stores the title control 
data VTSI(Fig. 16) included in signal St61 , and outputs a control information St2450 (St63) such as VTS_PGC. 
[0728] The VOB buffer 2402 temporarily stores the title VOB data VTSTT_VOB (Fig. 1 002), and the stream St67 to 

10 the system decoder 2500. 

[0729] The NV (navigation pack) extractor 2406 receives the VOB data at the same time with the VOB buffer 2402, 
and extracts the navigation pack NV therefrom. The NV extractor 2406 furthermore extracts the VOBU final pack 
address COBU_EA or ILVU final pack address ILVU_EA which are the DSI generation information DSI_GI shown in 
Fig. 19 to produce a pack address information St2452 (St63). 

15 [0730] The data counter 2408 receives the VOB data at the same time with the VOB buffer 2402, and counts each 
of pack data shown in Fig. 19 byte by bye. Then, the data counter 2408 produces a pack input terminating signal 
St2454 (St63) at the time when the inputting of pack data is completed. 

[0731] Due to its construction shown in Fig. 66, the stream buffer 2400 performs the VOBU data transfer as examples 
at step #31 064 of Fig. 72, as follows. The stream buffer 2400 outputs the VOBU data for the N V extractor 2406 and 
20 data counter 2408 at the same time when the VOBU buffer 2400 receives the VOBI data on the top of interleave unit 
ILVU. As a result, the NV extractor 2406 can extracts the data of ILVUJEA and NTJLVU_SA at the same time with 
the inputting of navigation pack data NV, and outputs thereof as signal St2452 (St63) to the decode system controller 
2300 (Fig. 26). 

[0732] The decode system controller 2300 stores the signal St2452 into the ILVU_EA_reg and NT_ILVU_SA_reg, 
25 and then start to counts the number of packs based on the pack terminating signal 2452 from the data counter 2408. 
Based on the fore mentioned the counted value of packs and ILVU_EA_reg, the decode system controller 2300 detects 
the instance when the inputting of final pack data of ILVU is completed, or the inputting final byte data of the final pack 
of the ILVU is completed. Then, the controller 2300 further give a command for the bitstream reproducer 2000 to move 
to the position having a sector address indicated by NT_ILVU_SA_reg. The bitstream producer 2000 moves to the 
30 sector address indicated NT_ILVU_SA_reg, and starts to read the data. Thus, the detection of final end of ILVU and 
reading process for the next ILVU can be performed effectively. 

[0733] In the above, an example where the multimedia data MBS is reproduced by the bitstream reproducer 2000 
without a buffering process, and is inputted to the stream buffer 2499. However, in the case that the signal processor 
2008 of the bitstream reproducer 2000 is incorporated with a buffer for error correction process, for example, the 
35 controller 2300 gives a moving command to reproducer 2000 so that the reproducer 2000 moves to the reading position 
indicated by NTJLVU_SA_reg after completion of the final pack data of fore mentioned ILVU and clearing the internal 
buffer of the reproducer 2000. Thus, the effective reproduction of ILVU data even when the bitstream reproducer 2000 
includes a buffer for error correction code (ECC) process. 

[0734] Furthermore, when the bitstream producer 2000 has a buffer for ECC process, the data can be transferred 
40 effectively by providing any means having a function equivalent to that of data counter 2408 (Fig. 66). In other words, 
the bitstream reproducer 2000 generates the pack input completion signal St62; the decode system controller 2300 
gives a command based on the signal St62 the bitstream reproducer 200 to move to the reading position having sector 
address designated by NTJLVU_SA_reg. As apparent from the above, the data can be transferred effectively even 
when the bitstream reproducer 2000 has a function to buffer the data reproduced from the recording media M. 
45 [0735] It is to be noted that the apparatus and method substantially the same as those described in the above with 
respect to the interleave unit ILVU can be used for the detection VOBU end. In other words, by replacing the extraction 
of ILVU__EA and NT_ILVU_Sa, and the storing of ILVU_EA_reg and NT_ILVU_SA_reg with the extraction of VOBU_EA 
and storing VOBU_EA_reg, the apparatus and method according to the present invention, described above, can be 
used for the detection of an_@end. This is effective for the VOBU data transferring operations shown at steps #31 074, 
50 #31082, #31092, and #31102. 

[0736] Thus, the reading of data such as ILVU and VOBU can be performed effectively. 

Decoding flows in the stream buffer 

55 [0737] The process for decoding data in the stream buffer 2400 shown as step #31 034 in Fig. 64 is described below 
referring to Fig. 67. This process (step #31034) comprises steps #31110, #31112, #31114, and #31116. 
[0738] At step #31110 data is transferred in pack units from the stream buffer 2400 to the system decoder 2500 (Fig. 
26). The procedure then moves to step #31112. 
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[0739] At step #31 1 1 2 the pack data is from the stream buffer 2400 to each of the buffers, i.e. , the video buffer 2600, 
sub-picture buffer 2700, and audio buffer 2800. 

[0740] At step #31112 the Ids of the user-selected audio and sub-picture data, i.e., the audio ID AUDIO_ID_reg and 
the sub-picture ID SP_ID_reg stored to the scenario information register shown in Fig. 62, are compared with the 
5 stream ID and sub-stream ID read from the packet header (Fig. 19), and the matching packets are output to the re- 
spective buffers. The procedure then moves to step #31114. 

[0741] The decode timing of the respective decoders (video, sub-picture, and audio decoders) is controlled in step 
#31114, i.e., the decoding operations of the decoders are synchronized, and the procedure moves to step #31116. 
[0742] Note that the decoder synchronization process of step #31114 is described below with reference to Fig. 68. 
10 [0743] The respective elementary strings are then decoded at step #31 1 1 6. The video decoder 3801 thus reads and 
decodes the data from the video buffer, the sub-picture decoder 31 00 reads and decodes the data from the sub-picture 
buffer, and the audio decoder 3200 reads and decodes the data from the audio buffer. 

[0744] This stream buffer data decoding process then terminates when these decoding processes are completed. 
[0745] The decoder synchronization process of step #31114, Fig. 67, is described below with reference to Fig. 68. 
15 This processes comprises steps #31120, #31122, and #31124 

[0746] At step #31 120 it is determined whether a seamless connection is specified between the current cell and the 
preceding cell. If a seamless connection, the procedure moves to step #31122, if not, the procedure moves to step 
#31124. 

[0747] A process synchronizing operation for producing seamless connections is executed in step #31122, and a 
20 process synchronizing operation for non-seamless connections is executed in step #31124. 

[0748] By means of the present invention thus described it is possible to read and supply to a decoder the source 

data for plural video objects without interruptions in the data during reproduction. When reproducing plural video objects 

of the same length, it is also possible to seamlessly reproduce the video objects without intermitting the data or the 

presentation time even from a midpoint in a video object. 
25 [0749] It is also possible to supply the data of plural video objects to the decoder without intermitting the data during 

presentation. Plural video objets can also be reproduced by reading only the necessary information and seamlessly 

switching the presented video object without intermitting the data during presentation. 

[0750] Switching between plural video objects can also be accomplished at high speed even while presentation is 
in process. Furthermore, it is possible to dynamically switch to another system stream according to a user selection 
30 even while in the middle of video object presentation when reproducing data from an optical disk, and these dynamic 
system stream changes can be executed seamlessly. 

Industrial Applicability 

35 [0751 ] As is apparent from a method and an apparatus according to the present invention for interleaving a bitstream 
to record the interleaved bitstream to a recording medium and reproduce the recorded bitstream therefrom is suitable 
for the application of an authoring system which can generate a new title by editing a title constructed by bitstreams 
carrying various information in accordance with the user's request, and is also suitable for a Digital Video Disk System, 
or DVD System being developed recently. 

40 

Claims 

1 . A bit stream generating method for generating a bit stream which is to be stored on an optical disc, said bit stream 
45 comprising a plurality of video objects (VOB) containing compressed video data, one of said video objects (VOB) 

is to be alternatively reproducible from among m video objects (VOB), said bit stream generating method being 
characterized by the steps of: 

providing m video objects (VOB), wherein m > 1 , 
so segmenting each of said m video objects (VOB) into v interleave units (ILVU), wherein v > 1 , 

determining a read rate Vr at which a track buffer reads data from the disc for temporarily storing the data in 
the track buffer, 

determining a supply rate Vo at which data are transferred to a decoder, 

obtaining a data quantity d1 stored to the track buffer at a differential rate Vr - Vo during a time period t1 , said 
55 time period t1 being a period during which an interleave unit (ILVU) is read from disc, 

interleaving said interleave units (ILVU) in a sequence 



ILVU 11 ..ILVU 21 ....ILVU (m . 1)1 ..ILVU m1 . 
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ILVU 12 ..ILVU 22 ....iLVU (m . 1)2 ..ILVU m2 .. 



ILVU^j JLVU 2(V _ 1} .... ILVU^^jJLVU^,.. 
ILVU 1v ..ILVU 2v ....ILVU (m . 1)v ..ILVU mv .. 

in such a way that said data quantity d1 exceeds a data quantity 62, said data quantity 62 being the data 
quantity which is transferred at a supply rate Vo from the track buffer to the decoder during a time period t2 at 
which reading a bit stream from the disc is to be interrupted when an optical head of the disc reproducing apparatus 
jumps between an interleave unit ILVUy and an interleave unit ILVUjQ +1 j, 

where i is a sequential index indicative of a video object (VOB) and ranges from 1 to m, and j is a sequential 
index indicative of an interleave unit (ILVU) and ranges from 1 to v. 

A bit stream generating method according to claim 1 , 

wherein a video object (VOB) contains compressed image data, and 

the step for selecting a video object (VOB) selects m video objects (VOB) having the same presentation time 
for the uncompressed image data contained therein. 

A bit stream generating method according to claim 1 , 

wherein said video object (VOB) contains image data compressed according to a MPEG standard, and a 
smallest control unit (VOBU) which is a sub region of said video object (VOB) constituting an interleave unit (ILVU) 
is bounded by a GOP unit, which is an image data compression unit. 

An optical disc (m) which stores a bit stream containing a plurality of video objects (VOB) containing compressed 
video data along a spiral track, 

characterized in that the bit stream has a data structure provided by a method according to one of the 
preceding claims. 

A disc (m) according to claim 4, wherein said interleave unit (ILVU) comprises one or more sub regions (VOBU) 
to which video data and management information are stored, and said management information comprises 

end address information (VOBU_EA) of the associated sub region (VOBU), and 

a flag (UNIT_END_flag) indicating whether the interleave unit terminates in the associated sub region. 

A disc (m) according to claim 4, wherein m video objects (VOB) associated with a same group contain compressed 
image data having the same presentation time when uncompressed, said interleave unit (ILVU) comprises one or 
more sub regions (VOBU) to which video data and management information are stored, and said management 
information comprises 

end address information (VOBU_EA) indicative of an end address of the associated sub region on disc, 
a flag (UNIT_END_flag) indicative of an end of an interleave unit, 

linking information (NTJLVU_SA) indicative of a position on the disc (m) of a next interleave unit (ILVU) to be 
reproduced, and 

angle-specific linking information (SML_AGL_C#1-9_DSTA) indicative of a position on the disc (m) of a next 
interleave unit (ILVU) to be reproduced for each video object (VOB) associated with a single GOP during 
alternative presentation. 

A disc (m) according to claim 4, wherein said interleave unit (ILVU) comprises one or more sub regions (VOBU) 
to which video data and management information are stored, wherein sub region (VOBU) image data is image 
data compressed using a MPEG method, and 

said sub region (VOBU) is bounded by a GOP unit, which is an image data compression unit. 

A disc reproduction apparatus (DCD) comprising 

a means for reading a bit stream including presentation sequence information, group information and video 
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objects (VOB) from a disc (m), the bit stream having a data structure provided by a method according to one 
of claims 1 to 3, 

a retrieving means for retrieving management information (NV) from a read video object (VOB), 
a buffer for storing the management information (NV), 
5 a presentation control means for controlling video object presentation according to a presentation sequence 

indicated by the presentation sequence information, 

an angle presentation detection means for detecting whether a read video object is an alternatively presentable 
video object based on the group information, 

an angle selection means for selecting one video object when a read video object is an alternatively presentable 
10 (- video object, 

an interleave unit end detection means for referencing end address information (ILVU_EA) for an interleave 
unit contained in the management information of a sub region being presented, and detecting whether pres- 
entation of an interleave unit has reached the end, and 

an angle presentation control means for presenting a selected video object when a read video object is an 
15 alternatively presentable video object, and when presentation has reached an end of an interleave unit, re- 

suming presentation of a video object from an interleave unit indicated by linking information (NTJLVILSA) 
in the stored management information (NV). 

9. A disc reproduction apparatus (DCD) according to claim 8, further comprising a means for reading presentation 
20 sequence information, group information, and a video object (VOB) from a disc (m), 

wherein the interleave unit end detection means detects whether presentation of a video object (VOB) has 
reached the end of an interleave unit based on end address information (VOBU_EA) for a sub region on disc, and 
a flag (UNIT_END_flag) indicative of an end of an interleave unit. 

25 10. A disc reproduction apparatus (DCD) according to claim 8, further comprising an angle change acceptance means 
and an angle change control means, 

wherein said reading means reads presentation sequence information, group information, and a video object 
(VOB) from a disc (m). 

30 said angle change acceptance means accepts an instruction changing a video object (VOB) to be alternatively 

presented, and determines a presentation start position for the alternatively selected video object according 
to the angle-specific linking information (SML_AGL_C#1-9_DSTA) in the management information (NV), and 
said angle change control means starts presentation of the specified alternatively presentable video object 
from a determined presentation starting position when an angle change instruction is received. 

35 

11. A reproduction method for a disc comprising the steps of 

reading a bit stream from the disc, the bit stream having a data structure provided by a method according to 
one of claims 1 to 3, 
40 selecting a video object to be presented, 

reading the selected video object, 

retrieving and storing management information (NV) from the read video object, 
presenting the read video object, 

detecting whether presentation of a video object has continued to an end of an interleave unit based on 

45 

a) interleave unit end address information (ILVU_EA) and/or 

b) end address information (VOBU_EA) indicative of an end address of the associated sub region on disc, 
and a flag (UNIT_END_flag) indicative of an end of an interleave unit, 

contained in the management information (NV), and 

50 

resuming video object presentation from a position indicated by a linking information (NT_ILVU_SA) in the 
stored management information when video object presentation has been completed to an interleave unit end. 

12. A reproduction method for a disc according to claim 11 , further comprising the steps of: 

55 

accepting an instruction for selecting a video object (VOB) to be alternatively presented, 
determining a presentation start position for the alternatively selected video object according to the angle- 
specific linking information (SML_AGL_C#1-9_DSTA) in the management information (NV) when a video ob- 
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ject change instruction is accepted, and 

starting presentation of the alternatively selected video object from a determined presentation starting position 
when an angle change instruction is received. 

5 

Patentanspruche 

1. Bitstrom-Erzeugungsverfahren zum Erzeugen eines Bitstromes, welcher auf einer optischen Disk zu speichem 
ist, wobei der Bitstrom eine Mehrzahl von Videoobjekten (VOB) mit komprimierten Videodaten umfasst, wobei 
10 eines der Videoobjekte (VOB) alternativ aus m Videoobjekten (VOB) wiedergebbar ist, wobei das Bitstrom-Erzeu- 

gungsverfahren gekennzeichnet ist durch die Schritte: 

Bereitstellen von m Videoobjekten (VOB), mit m > 1, 

Segmentieren von jedem der m Videoobjekte (VOB) in v Verschachtelungseinheiten (ILVU), mit v > 1 , 
is Bestimmen einer Lesegeschwindigkeit Vr, mit welcher ein Spurpuffer Daten von der Disk liest, zum voruber- 

gehenden Speichern der Daten in dem Spurpuffer, 

Bestimmen einer Liefergeschwindigkeit Vo, mit welcher Daten zu einem Dekodierer ubertragen werden, 
Erhalten einer in dem Spurpuffer wahrend eines Zeitabschnittes t1 mit einer Differenzgeschwindigkeit Vr-Vo 
gespeicherten Datenmenge d1 , wobei derZeitabschnitt t1 eine Periode ist, wahrend welcher eine Verschach- 
20 telungseinheit (ILVU) von der Disk gelesen wird, 

Verschachteln der Verschachtelungseinheiten (ILVU) in einer Folge 

ILVU^JLVU^.^ILVU^^JLVU,, 

25 iLVU 12 ..ILVU 22 ....ILVU (m . 1)2 ..ILVU m2 



ILVU 1(v . 1) ..ILVU 2(v . 1) ....ILVU (m . 1)(v . 1) ..ILVU m(v . 1) 

30 

ILVU^JLVU^^ILVU^.^JLVU^ 

in solch einer Weise, dass die Datenmenge d1 eine Datenmenge d2 uberschreitet, wobei die Datenmenge d2 die 
Datenmenge ist, welche mit einer Liefergeschwindigkeit Vo von dem Spurpuffer zu dem Dekodierer wahrend eines 
35 Zeitabschnittes t2 ubertragen wird, wahrend welchem das Lesen eines Bitstromes von der Disk zu unterbrechen 

ist, wenn ein optischer Kopf der Disk-Wiedergabevorrichtung zwischen einer Verschachtelungseinheit ILVUy und 
einer Verschachtelungseinheit ILVU j( j +1) springt, 

wobei i ein sequenzieller Index ist, welcher ein Videoobjekt (VOB) angibt und von 1 bis m reicht, und j ein sequen- 
zieller index ist, der eine Verschachtelungseinheit (ILVU) angibt und von 1 bis v reicht. 

40 

2. Bitstrom-Erzeugungsverfahren nach Anspruch 1 , 

bei welchem ein Videoobjekt (VOB) komprimierte Bilddaten enthalt, und 

der Schritt zum Auswahlen eines Videoobjektes (VOB) m Videoobjekte (VOB) mit der gleichen Darstellungszeit 
45 fur die darin enthaltenen unkomprimierten Bilddaten auswahlt. 

3. Bitstrom-Erzeugungsverfahren nach Anspruch 1, 

bei welchem das Videoobjekt (VOB) entsprechend einem MPEG-Standard komprimierte Bilddaten enthalt, und 
eine kleinste Steuerungseinheit (VOBU), welche eineTeil-Region des Videoobjektes (VOB) ist, welche eine Ver- 
so schachtelungseinheit (ILVU) bildet, welche durch eine GOP-Einheit gebunden ist, welche eine Bilddaten-Kom- 
pressionseinheit ist. 

4. Optische Disk (m), welche einen Bitstrom mit einer Mehrzahl von Videoobjekten (VOB) mit komprimierten Vide- 
odaten entlang einer spiralformigen Spur speichert, 

55 dadurch gekennzeichnet, dass der Bitstrom eine Datenstruktur aufweist, welche durch ein Verfahren nach einem 

der vorstehenden Anspruche vorgegeben ist. 

5. Disk (m) nach Anspruch 4, bei welcher die Verschachtelungseinheit (ILVU) eine Oder mehrere Teil-Regionen (VO- 
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BU) umfasst, in welchen Videodaten und Verwaltungsinformationen gespeichert sind, wobei die Verwaltungsin- 

formationen umfassen Endadress-lnformationen (VOBU_EA) der zugeordneten Teil-Region (VOBU) und 

ein Flag (UNIT_END_flag), welches angibt, ob die Verschachtelungseinheit in der zugeordneten Teil-Region endet, 

umfassen. 

Disk (m) nach Anspruch 4, bei welcher m Videoobjekte (VOB), welche einer gleichen Gruppe zugeordnet sind, 
komprimierte Bilddaten enthalten, mit der gleichen Darstellungszeit, wenn sie unkomprimiert sind, wobei die Ver- 
schachtelungseinheit (ILVU) eine Oder mehrere Teil-Regionen (VOBU) umfasst, in welchen Videodaten und Ver- 
waltungsinformationen gespeichert sind, und wobei die Verwaltungsinformationen 

Endadress-lnformationen (VOBLLEA), welche eine Endadresse der zugeordneten Teil-Region auf der Disk 
angeben, 

ein Flag (UNIT_END_flag), welches ein Ende einer Verschachtelungseinheit anzeigt, Verkn upfungs inform a- 
tionen (NTJLVU_SA), welche eine Position einer nachsten wiederzugebenden Verschachtelungseinheit (IL- 
VU) auf der Disk (m) angeben, und 

Winkel-spezifische Verknupfungsinformationen (SML_AGL_C#1-9_DSTA), welche eine Position einemach- 
sten wiederzugebenden Verschachtelungseinheit (ILVU) auf der Disk (m) fur jedes einer einzelnen GOP zu- 
geordneten Videoobjekt (VOB) wahrend einer alternativen Darstellung angibt, umfassen. 

Disk (m) nach Anspruch 4, bei welcher die Verschachtelungseinheit (ILVU) eine oder mehrere Teil-Regionen (VO- 
BU) umfasst, in welchen Videodaten und Verwaltungsinformationen gespeichert sind, wobei die Bilddaten in der 
Teil-Region (VOBU) unter Verwendung eines MPEG-Verfahrens komprimierte Bilddaten sind und 
die Teil-Region (VOBU) durch eine GOP-Einheit gebunden ist, welche eine Bilddaten-Kompressionseinheit ist. 

Disk-Wiedergabevorrichtung (DCD), mit 

einer Einrichtung zum Lesen eines Bitstromes mit einer Darstellungssequenz-lnformation, Gruppeninforma- 
tionen und Videoobjekten (VOB) von einer Disk (m), wobei der Bitstrom einen durch ein Verfahren nach einem 
der Anspruche 1 bis 3 vorgegebenen Datenaufbau aufweist, 

einer Abfrage-Einrichtung zum Abfragen von Verwaltungsinformationen (NV) von einem gelesenen Videoob- 
jekt (VOB), 

einem Puffer zum Speichern der Verwaltungsinformationen (NV), 

einer Darstellungs-Steuerungseinrichtung zum Steuern der Vldeoobjekt-Darstellung 

gemaB einer durch die Darstellungssequenz-lnformation angegebenen Darstellungssequenz, 

einer Winkel-Darstellungs-Erfassungseinrichtung zum Erfassen, ob ein gelesenes Videoobjekt ein alternativ 

darstellbares Videoobjekt ist, basierend auf den Gruppeninformationen, 

einer Winkel-Auswahleinrichtung zum Auswahlen eines Videoobjektes, wenn ein gelesenes Videoobjekt ein 
alternativ darstellbares Videoobjekt ist, 

einer Verschachtelungseinheiten-End-Erfassungseinrichtung zum Referenzieren einer End-Adressinformati- 
on (ILVU_EA) fur eine Verschachtelungseinheit, enthalten in der Verwaltungsinformation einer dargestellten 
Teil-Region und zum Erfassen, ob eine Darstellung einer Verschachtelungseinheit das Ende erreicht hat, und 
einer Winkel-Darstellungs-Steuerungseinrichtung zum Darstellen eines ausgewahlten Videoobjektes, wenn 
ein gelesenes Videoobjekt ein alternativ darstellbares Videoobjekt ist, und, wenn die Darstellung ein Ende 
einer Verschachtelungseinheit erreicht hat, zur Wiederaufnahme der Darstellung eines Videoobjektes von 
einer Verschachtelungseinheit, angezeigt durch Verknupfungsinformationen (NT_IL-VU_SA) in der gespei- 
cherten Verwaltungsinformation (NV). 

Disk-Wiedergabevorrichtung (DCD) nach Anspruch 8, weiter mit einer Einrichtung zum Lesen von Darstellungs- 

sequenz-lnformationen, Gruppeninformationen und einem Videoobjekt (VOB) von einer Disk (m), 

wobei die Verschachtelungseinheiten-End-Erfassungseinrichtung erfasst, ob die Darstellung eines Videoobjektes 

(VOB) das Ende einer Verschachtelungseinheit erreicht hat, basierend auf der Endadress-lnformation (VOBU_EA) 

fur eine Teil-Region auf der Disk, und ein Flag (UNIT_END_flag), welches ein Ende einer Verschachtelungseinheit 

angibt. 

Disk-Wiedergabevorrichtung (DCD) nach Anspruch 8, mit einer Winkel-Anderungs-Akzeptanz-Einrichtung und ei- 
ner Winkel-Anderungs-Steuerungseinrichtung, wobei die Leseeinrichtung Darstellungssequenz-lnformationen, 
Gruppeninformationen und ein Videoobjekt (VOB) von einer Disk (m) Nest, 
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wobei die Winkel-Anderungs-Akzeptanz-Einrichtung eine Anweisung akzeptiert, welche ein alternativ darzu- 
stellendes Videoobjekt (VOB) andert, und eine Darstellungs-Anfangsposition fur das alternativ ausgewahlte 
Videoobjekt gemaB der Winkel-spezifischen Verknupfungsinformation (SML_AGL_C#1-9_DSTA) in der Ver- 
waltungsinformation (NV) bestimmt, und 
5 wobei die Winkel-Anderungs-Steuerungseinrichtung mit der Darstellungs des bestimmten, alternativ darstell- 

baren Videoobjektes von einer bestimmten Darstellungs-Anfangsposition aus beginnt, wenn eine Winkelan- 
derungs-Anweisung empfangen wird. 

11 . Wiedergabeverfahren fur eine Disk, mit den Schritten: 

10 

Lesen eines Bitstromes von der Disk, wobei der Bitstrom einen durch ein Verfahren nach einem der Anspruche 
1 bis 3 vorgegebenen Datenaufbau aufweist, 
Auswahlen eines darzustellenden Videoobjektes, 
Lesen des ausgewahlten Videoobjektes, 
15 Abfragen und Speichern von Verwaitungsinformationen (NV) von dem gelesenen Videoobjekt, 

Darstellen des gelesenen Videoobjektes, 

Erfassen, ob die Darsteilung eines Videoobjektes bis zu einem Ende der Verschachteiungseinheit fortgesetzt 
wurde, basierend auf 

20 a) Verschachtelungseinheiten-Endadress-lnformationen (ILVU_EA) und/oder 

b) Endadress-lnformationen (VOBU_EA), welche eine Endadresse derzugeordneten Teil-Region auf der 
Disk anzeigen, und einem Flag (UNIT_END_flag), welches ein Ende einer Verschachteiungseinheit an- 
zeigt, 

enthalten in der Verwaltungsinformation (NV), und 

25 

Wiederaufnahme der Videoobjekt-Darstellung von einer durch eine Verknupfungsinformation (NTJLVU_SA) 
in der gespeicherten Verwaltungsinformation angezeigten Position, wenn die Videoobjekt-Darstellung bei ei- 
nem Verschachtelungseinheiten-Ende abgeschlossen wurde. 

30 12. Wiedergabeverfahren fur eine Disk nach Anspruch 11 , weiter mit den Schritten: 

Akzeptieren einer Anweisung zum Auswahlen eines alternativ darzustellenden Videoobjektes (VOB), 
Bestimmen einer Darstellungs-Anfangsposition fur das alternativ ausgewahlte Videoobjekt gemaf3 der Winkel- 
spezifischen Verknupfungsinformation (SML_AGL_C#1-9_DSTA) in der Verwaltungsinformation (NV), wenn 
35 eine Video-objekt-Wechsel-Anweisung akzeptiert wird, und 

Beginnen der Darsteilung des alternativ ausgewahlten Videoobjektes von einer vorbestimmten Darstellungs- 
Anfangsposition, wenn eine Winkel-Anderungs-Anweisung empfangen wird. 

40 R eve ridi cations 

1. Procede de generation d'un flux de bits destine a generer un flux de bits qui doit etre memorise sur un disque 
optique, ledit flux de bits comprenant une pluralite d'objets video (VOB) contenant des donnees video compres- 
sees, Tun desdits objets video (VOB) devant etre reproduit en alternance parmi m objets video (VOB), ledit procede 
45 de generation de flux de bits etant caracterise par les etapes consistant a : 

foumir m objets video (VOB), dans lequel m > 1 , 

segmenter chacun desdits m objets video (VOB) en v unites d'entrelacement (ILVU), dans lequel v > 1 , 
determiner un debit de lecture Vr auquel une memoire tampon de piste lit les donnees a partir du disque af in 

50 de memoriser temporairement les donnees dans la memoire tampon de piste, 

determiner un debit d'alimentation Vo auquel des donnees sont transferees vers un decodeur, 
obtenir une quantite de donnees d1 memorisees dans la memoire tampon de piste a un debit differentiel Vr- 
Vo durant un intervalle de temps t1 , ledit intervalle de temps t1 etant une periode durant laquelle une unite 
d'entrelacement (ILVU) est lue a partir du disque, 

5 5 entrelacer lesdites unites d'entrelacement (ILVU) suivant une sequence 

ILVU„ ...ILVU 21 ... ILVU^.JLVU^. 
ILVU 12 ...ILVU 22 ... ILVU (m . 1)2 ..ILVU m2 . 
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ILVU 1(M) .. ILVU 2(V . 1} .... ILVU^,^, ILVU mM) .. 
ILVU 1v ... ILVU*... ILVU (m . 1)v .. ILVU mv . 

de telle maniere que ladite quantite de donnees d1 depasse une quantite de donnees 62 , ladite quantite de donnees 
62 etant la quantite de donnees qui est transferee a un debit d'alimentation Vo depuis la memoire tampon de piste 
vers le decodeur durant un intervalle de temps t2 auquel la lecture d'un flux de bits a partir du disque doit etre 
interrompue lorsqu'unetete optique de I'appareil de reproduction de disque saute entre une unite d'entrelacement 
ILVUjj et une unite d'entrelacement ILVUjQ +1 ), 

ou i est un index sequentiel indicatif d'un objet video (VOB) etvade 1 am, etj estun index sequentiel indicatif 
d'une unite d'entrelacement (ILVU) et va de 1 a v. 

Precede de generation d'un flux de bits selon la revendication 1 , 

dans lequel un objet video (VOB) contient des donnees d'image compressees, et 

I'etape de selection d'un objet video (VOB) selectionne m objets video (VOB) ayant le meme temps de pre- 
sentation pour les donnees d'image non compressees contenues dans ceux-ci. 

Procede de generation d'un flux de bits selon la revendication 1 , 

dans lequel ledit objet video (VOB) contient des donnees d'image compressees conformement a un standard 
MPEG, et une unite de commande la plus petite (VOBU) qui est une sous-region dudit objet video (VOB) constituant 
une unite d'entrelacement (ILVU) est limitee par une unite d'un groupe d'images GOP qui constitue une unite de 
compression de donnees d'image. 

Disque optique (m) qui memorise un flux de bits contenant une pluralite d'objets video (VOB) contenant des objets 
video compresses le long d'une piste en spirale, 

caracterise en ce que le flux de bits presente une structure de donnees fournie par un procede selon I'une 
des revendications precedentes. 

Disque (m) selon la revendication 4, dans lequel ladite unite d'entrelacement (ILVU) comprend une ou plusieurs 
sous-regions (VOBU) dans lesquelles des donnees video et des informations de gestion sont memorisees, et 
lesdites informations de gestion comprennent 

des informations d'adresse de fin (VOBU_EA) de la sous-region associee (VOBU), et 

un indicateur (UNIT_END_flag) indiquant si I'unite d'entrelacement se termine dans la sous-region associee. 

Disque (m) selon la revendication 4, dans lequel m objets video (VOB) associes a un meme groupe contiennent 
des donnees d'image compressees ayant le meme temps de presentation lorsqu'elles ne sont pas compressees, 
ladite unite d'entrelacement (ILVU) comprend une ou plusieurs sous-regions (VOBU) dans lesquelles des donnees 
video et des informations de gestion sont memorisees, et lesdites informations de gestion comprennent 

des informations d'adresse de fin (VOBU_EA) indicatives d'une adresse de fin de la sous-region associee sur 
un disque, 

un indicateur (UNIT_END_flag) indicatif d'une extremite d'une unite d'entrelacement, 
des informations de chamage (NT_ILVU_SA) indicatives d'une position sur le disque (m) d'une unite d'entre- 
lacement suivante (ILVU) devant etre reproduce, et 

des informations de chamage specifiques a un angle (SML_AGL_C#1 -9_DSTA) indicatives d'une position sur 
le disque (m) d'une unite d'entrelacement suivante (ILVU) devant etre reproduite pour chaque objet video 
(VOB) associe a un seul groupe GOP durant une presentation differente. 

Disque (m) selon la revendication 4, dans lequel ladite unite d'entrelacement (ILVU) comprend une ou plusieurs 
sous-regions (VOBU) dans lesquelles des donnees video et des informations de gestion sont memorisees, dans 
lequel des donnees d'image d'une sous-region (VOBU) sont des donnees d'image compressees utilisant un pro- 
cede de type MPEG, et 

ladite sous-region (VOBU) est limitee par une unite de groupe GOP, qui est une unite de compression de 
donnees d'image. 

Appareil de reproduction de disque (DCD) comprenant 
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un moyen destine a lire un flux de bits comprenant des informations de sequence de presentation, des infor- 
mations de groupe et des objets video (VOB) a partir d'un disque (m), le flux de bits presentant une structure 
de donnees fournie par un precede selon Tune des revendications 1 a 3, 

un moyen de recuperation destine a recuperer des informations de gestion (NV) a partir d'un objet video lu 
5 (VOB), 

une memoire tampon destinee a memoriser des informations de gestion (NV), 

un moyen de commande de presentation destine a commander une presentation d'objet video conformement 
a une sequence de presentation indiquee par les informations de sequence de presentation, 
un moyen de detection de presentation selon un angle destine a detecter si un objet video lu est un objet video 
. 10 pouvant etre presente de fagon differente sur la base des informations de groupe, 

un moyen de selection d'angle destine a selectionner un objet video lorsqu'un objet video lu est un objet video 
pouvant etre presente de fagon differente, 

un moyen de detection de fin d'unite d'entrelacement destine a referencer des informations d'adresse de fin 
(ILVUJEA) pour une unite d'entrelacement contenue dans les informations de gestion d'une sous-region qui 

15 est presentee, et detecter si une presentation d'une unite d'entrelacement a atteint la fin, et 

un moyen de commande de presentation suivant un angle destine a presenter un objet video selectionne 
lorsqu'un objet video lu est un objet video pouvant etre presente de f agon differente, et lorsque la presentation 
a atteint une extremite d'une unite d'entrelacement, reprendre la presentation d'un objet video a partir d'une 
unite d'entrelacement indiquee par des informations de chamage (NT_ILVU_SA) dans les informations de 

20 gestion memorisees (NV). 

9. Appareil de reproduction de disque (DCD) selon la revendication 8, comprenant en outre un moyen destine a lire 
des informations de sequence de presentation, des informations de groupe et un objet video (VOB) a partir d'un 
disque (m), 

25 dans lequel le moyen de detection de fin d'unite d'entrelacement detecte si la presentation d'un objet video 

(VOB) a atteint la fin d'une unite d'entrelacement sur la base d'informations d'adresse de fin (VOBU_EA) pour une 
sous-region sur un disque, et un indicateur (UNIT_END_flag) indicatif d'une fin d'une unite d'entrelacement. 

10. Appareil de reproduction de disque (DCD) selon la revendication 8, comprenant en outre un moyen d'acceptation 
30 de changement d'angle et un moyen de commande de changement d'angle, 

dans lequel ledit moyen de lecture lit des informations de sequence de presentation, des informations de 
groupe, et un objet video (VOB) a partir d'un disque (m), 

ledit moyen d'acceptation de changement d'angle accepte une instruction changeant un objet video (VOB) 
35 devant etre presente de fagon differente, et determine une position de debut de presentation pour I'objet video 

selectionne de fagon differente conformement aux informations de chamage specifiques a un angle 
(SML_AGL_C#1-9_J)STA) dans les informations de gestion (NV), et 

ledit moyen de commande de changement d'angle debute une presentation de I'objet video pouvant etre 
presentee de fagon differente specif iee a partir d'une position de debut de presentation determinee lorsqu'une 
40 instruction de changement d'angle est regue. 

11. Precede de reproduction destine a un disque comprenant les etapes consistant a 

lire un flux de bits a partir du disque, le flux de bits presentant une structure de donnees foumie par un precede 
45 selon I'une des revendications 1 a 3, 

selectionner un objet video a presenter, 
lire I'objet video selectionne, 

recuperer et memoriser des informations de gestion (NV) a partir de I'objet video lu, 
presenter I'objet video lu, 

50 detecter si une presentation d'un objet video s'est poursuivie jusqu'a une fin d'une unite d'entrelacement sur 

la base 

a) d'informations d'adresse de fin d'unite d'entrelacement (ILVU_EA), et/ou 

b) d'informations d'adresse de fin (VOBU_EA) indicatives d'une adresse de fin de la sous -region associee 
55 sur un disque, et un indicateur (UNIT_END_flag) indicatif d'une fin d'une unite d'entrelacement, contenues 

dans les informations de gestion (NV), et 



reprendre une presentation d'objet video a partir d'une position indiquee par des informations de chamage 
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(NTJLVU_SA) dans les informations de gestion memorisees lorsqu'une presentation d'objet video a ete ache- 
vee a une fin d'unite d'entrelacement. 

12. Precede de reproduction destine a un disque selon la revendication 1 1 , comprenant en outre les etapes consistant 

a: 

recevoir une instruction destinee a selectionner un objet video (VOB) devant etre presente d'une facon diffe- 
rente, 

determiner une position de debut de presentation pour I'objet video selection ne de facon differente conforme- 
ment aux informations de chainage specifiques a un angle (SML_AGL_C#1-9_DSTA) dans les informations 
de gestion (NV) lorsqu'une instruction de changement d'objet video est regue, et 

debuter une presentation de I'objet video selectionne de facon differente a partir d'une position de debut de 
presentation determinee lorsqu'une instruction de changement d'angle est recue. 
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Fig. 7 
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Fig. 13 
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Setting of decode table 
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VOB_A_STP_PTM2_reg, 
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VOB_A_STP_PTM2_reg, 
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